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Methods and kits for diagnosing and treating B-Celi Chronic Lymphocytic 
Leui<emia (B-CLL) 

All patent and non-patent references cited In the present application, are hereby 
incorporated by reference in their entirety. 

5 Field of invention 

The present invention relates to nnethods and l<lts for detecting several polynucleotide 
sequence found to be indicative of a poor prognosis of B-CLL. All the polynucleotides are 
transcribed from a region on human chromosome 12p21-22. Most of the polynucleotides 
do not encode larger polypeptides, but may encode small peptides, they may function as 
10 RNAs. Four polynucleotides encode a novel protein, which in one preferred embodiment 
can be used as a cytokine, preferably as an interleukin. Furthermore the invention relates 
to methods and compositions for treating B-CLL in particular poor prognosis B-CLL. 

Background of Invention 

B-CLL Is the most common form of leukaemia in Denmark, with more than 250 new cases 

15 diagnosed every year. The disease results in accumulation of CD19+CD5+CD23+ 

lymphocytes in the blood, bone marrow and organs of the patients. B-CLL cells are long- 
lived, slowly dividing and locked in the Gi phase of the cell cycle. At this time it Is unknown 
how or why B-CLL occurs and no cure Is known for B-CLL. The application of more 
aggressive treatment strategies has been hampered by the Inability to Identify 

20 reproducible and reliable prognostic predictors In patients with poor outcome in this 
disease. In many patients the diagnosis does not affect morbidity or mortality. Other 
patients suffer from an Incurable cancer that inevitably results in death, regardless of 
treatment. Until recently this latter group of patients couid not be identified at the time of 
diagnosis. Recently, two studies established the mutational status of immunoglobulin 

25 variable region of the heavy chain (Ig Vh) genes in B-CLL as Independent prognostic 
markers, within each clinical stage (Damie, et al. & Hamblin, et al.). Patients without 
somatic hypermutatlon show much shorter survival than patients with somatic 
hypermutation. FISH-studles of cytogenetic aberrations In B-CLL established specific 
abnormalities on chromosomes 11 (AT!^), 12 (?), 13 (Leu-l and-2) and 17 (p53) as 

30 independent prognostic markers, within each clinical stage (Dohner, et al.). Very recent 
studies have demonstrated that independent risk prediction, using a combined analysis of 
Ig Vh gene mutational analysis and cytogenetics, can identify subgroups of B-CLL with 
median survivals ranging from less than 2.5 years to more than 15 years (Krober, et al., 
Lin, et al., & Oscier, et al.) (see Figure 1). Since the process of characterising the Ig VH 

35 gene mutational status of an Individual patient Is cumbersome, it is desirable to provide 
easier tests based on diagnostic markers for use in the differential diagnosis of such cancer 
patients. 
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Summary of invention 

It is an object of preferred embodiments of the present invention to provide differentially 
expressed transcription products, whicli can be used as prognostic marlcers of disease and 
give information about the differences in etiology between different groups of B-CLL 
5 patients. These differentially expressed transcription products are genetic markers that can 
be used In an easy assay to distinguish between subgroups of B-CLL patients and 
especially Identify B-CLL patients with a poor prognosis. 

This method for diagnosing a subtype of B-cell chronic lymphocytic leukaemia (B-CLL) 
10 comprises the steps of determining the presence or absence of at least one expression 
product such as a transcriptional product which comprise a nucleotide sequence selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 In a biological sample isolated from a 
individual. As evidenced by the appended examples, the present inventors have 
15 determined that the expression products of this invention are present in one subtype of B- 
CLL having poor prognosis and thus of great diagnostic value and Independent prognostic 
value. Equally important, an expression product comprising a nucleotide sequence selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 has not been found In any of the other 
20 tissue types tested (see e.g. Figure 8). 

The vast majority of patients which show expression of the AMB-1 gene in form of at least 
one of the expression products selected from the group consisting of SEQ ID No: 12, SEQ 
ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 

25 show unmutated Ig V(H) genes which is consistent with poor prognosis B-CLL. The 
presence of an expression product of the AMB-1 gene can be determined easily using 
standard laboratory procedures and equipment. Therefore the diagnostic method provided 
by the present Inventors provides an easy method of diagnosis as compared to the 
determination of the mutation status of Ig V(H) genes and can furthermore give additional 

30 Information about the prognosis. 

Accordingly, a further object of preferred embodiments of the present invention is a 
method for determining the progress of B-CLL comprising determining the amount of at 
feast one expression product which comprise a nucleotide sequence selected from the 
35 group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID 
No: 16, SEQ ID No: 17 and SEQ ID No: 18 in a biological sample Isolated from an individual. 
The method may be used e.g. for determining the efficiency of a treatment, Le. to see 
whether the amount of the expression product decreases or increases in response to a 
curative treatment, 

40 

The expression products of the present invention are all transcripts of SEQ ID NO:l and/or 
SEQ ID No:5, the gene of the present invention called AI^B-1 which also encodes a novel 
polypeptide (SEQ ID N0:3). 
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A further object of preferred embodiments of the present Invention Is to provide a cure 
and/or treatment of patients with B-CLL, In particular of patients with poor prognosis B- 
CLL such as the sub-type of B-CLL which is characterised by the presence of an expression 
product of the present invention. 

5 

The method for treating B-CLL comprises administering to an individual with a B-CLL 
diagnosis a compound capable of decreasing or inhibiting the formation of an expression 
product of SEQ ID N0:1 and/or SEQ ID NO: 5. This expression product preferably 
comprises a nucleotide sequence selected from the group consisting of SEQ ID No: 12, SEQ 
10 ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18. 
The present Inventors believe that the presence of at least one of said expression products 
is an etiological factor in B-CLL and that the disease can be treated or cured by inhibiting 
the expression of at least one of such products and/or by Inhibiting the effect of such 
product by e.g. rendering It Inactive. 

15 

A further preferred object of embodiments of the present invention Is to destroy or to 
eliminate the transcription of at least one expression product comprising at least one 
nucleotide sequence selected form the group consisting of SEQ ID No: 12, SEQ ID No: 13, 
SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 for the 
20 treatment of cancer, such as a poor prognosis sub-type of B-CLL. 

The above destruction or elimination Is obtained by applying polynucleotides or 
oligonucleotides In the form of small interfering RNA molecules (sIRNA), antlsense 
molecules or ribozymes. 

25 

In one aspect the invention relates to a gene therapy vector capable of inhibiting or 
decreasing the formation of an expression product of SEQ ID NO:l and/or SEQ ID NO: 5, 
said gene therapy vector preferably encoding a specific siRNA molecule, a specific 
antlsense molecule or a specific ribozyme being capable of decreasing or inhibiting the 
30 formation of an expression product of SEQ ID N0:1 and/or SEQ ID N0:5. This gene 
therapy vector can be used for treating B-CLL based on the finding that the AMB-1 gene 
encoded by SEQ ID No:l and/or SEQ ID No:5 Is an etiological factor in B-CLL. 

Both SEQ ID No 1 which Is a 20,000 nucleotide long sequence and SEQ ID No 5 which is a 
35 80,000 nucleotide long sequence provides several transcriptional products in B-CLL cells in 
patients with poor prognosis B-CLL. Some of the transcriptional products e.g. SEQ ID No 2 
and SEQ ID No 4 consists of two exons (SEQ ID No: 15 and SEQ ID No: 16) separated by 
the same intron. Both mRNA sequences encode an open reading frame (SEQ ID No: 17) 
encoding a 121 amino acid peptide (SEQ ID No 3). 

40 

Accordingly, yet another object of preferred embodiments of the present invention relates 
to a novel class of polypeptides. These may be described as a group of isolated 
polypeptides or proteins comprising or essentially consisting of the amino acid sequence of 
SEQ ID No. 3, or a fragment thereof, or a polypeptide functionally equivalent to SEQ ID 
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No. 3, or a fragment thereof, wherein said fragment or functionally equivalent polypeptide 
has at least 60% sequence identity with the polypeptide of SEQ ID No 3. The polypeptides 
of the present invention may have interleukin or cytokine activity. 

5 In a still further aspect the invention relates to an isolated polynucleotide selected from 
the group consisting of: 

i) a polynucleotide comprising nucleotides of SEQ ID No 5, 

11) a polynucleotide encoding a polypeptide having the amino add sequence of SEQ ID No 
10 3, 

lii) a polynucleotide, the complementary strand of which hybridises, under stringent 
conditions, with a polynucleotide as defined In any of I) and ii). 

iv) a polynucleotide which is degenerate to the polynucleotide of ili), and 

v) the complementary strand of any such polynucleotide. 

15 

The polypeptides encoded by the polynucleotides may furthermore 

a) have at least 60 % sequence identity with the amino acid sequence of SEQ ID No 3 
and have interleukin or cytokine activity, 

b) be recognised by an antibody, or a binding fragment thereof, which is capable of 

20 recognising an epitope, wherein said epitope is comprised. within a polypeptide having 
the amino acid sequence of SEQ ID No 3; and/or 

c) be competing with a polypeptide having the amino acid sequence as shown in SEQ ID 
No 3 for binding to at least one predetermined binding partner such as a cytokine 
receptor. 

25 

One further therapeutic application of the present invention is a method of vaccination 
against B-CLLsaid method comprising immunising an individual against a translational 
product of SEQ ID No:l and/or SEQ ID No: 5. By stimulating the Immune system of an 
individual to produce antibodies against the trdnstational product the individual can 
30 become immune towards B-CLL and/or the method can be used as part of therapy. The 
state of the art describes various ways of immunising an individual against a particular . 
protein. 

Finally, the Invention provides a method for determining an increased or decreased 
35 predisposition for B-CLL comprising determining in a biological sample from an Individual a 
germline alteration in a target nucleic acid sequence comprising 150,000 nucleotides, said 
target nucleic acid sequence comprising at least 10 nucleotides of SEQ ID Norl and/or SEQ 
ID No:5. This aspect is based on the finding of the importance of the expression product of 
SEQ ID No:l and/or SEQ ID No: 5, and the absence of any detectable expression product 
40 of SEQ ID Norl and/or SEQ ID No: 5 in healthy tissue and in patients with good prognosis 
B-CLL. It is highly likely that the difference is caused by a germline alteration. A germline 
alteration can be targeted by gene therapy methods and by the methods provided in the 
present invention. 
Description of Drawings 
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Figure 1: Overall survival of B-CLL patients by genotype (all stages) The prognostic 
significance of Vh homology and cytogenetic aberrations Is Independent of clinical stage 
(from Krober et al., 2002 (4)). 

5 Rgure 2 (a-d): Survival curves for survival or time to progression based on AMB-1 

expression or IgVH mutational status respectively. Patients are newly diagnosed, untreated 
B-CLL patients (n=34). 

Figure 3: Northern blotting demonstrating the expression level of AMB-1 in various tissue 
10 samples using as a probe an 875 base pair fragment of Exon 3/Seq ID No: 16. UPN 1, 4 
and 7 are unmutated B-CLL patients, UPN 19, 9, 10, 13 and 21 are mutated B-CLL 
patients. Included are RNA samples from normal Colon, Spleen, Bone i^larrow and PBL 
(peripheral blood lymphocytes) and RNA from the Ramos and Granta cell lines. Equal 
loading of lanes was confirmed by re-probing with an actin probe (results not shown). 

15 

Figure 4. Alignment of AMBl with IL4 based on structurel similarity. IL4 is called dllara in 
the alignment. The additional tines Indicate the structural similarity. 

Figure 5. A 3D search, where the peptide sequence has been searched for similarity to 
20 i<nown protein or peptide 3D-structures. 

Figure 6. Predicted 3-D structure of AMB-1 compared to the known 3-D structure of human 
IL4. Prediction is performed using SEQ ID No: 3 and the method described in: Enhanced 
Genome Annotation using Structural Profiles in the Program 3D-PSSM. Keliey LA, 
25 MacCallum Rl^ & Sternberg M3E (2000). J. I^loi. Biol. 299(2), 499-520. 

Figure 7. Alignment of the AMBl peptide sequence with the sequences of IL4, IL3, IL13 
and GM-CSF, based on their structures. 

30 Figure 8. A table showing the tissue types on the MTE array used for dot blotting of AMB-1 
to checl< for expression in other tissue types. 

Figure 9. Schematic representation of the transcriptional products of the present invention 
compared to the genomic AMBl sequence (1 & 5). 2 is mRNA short form (SEQ ID No 2). 4 
35 is mRNA long form (SEQ ID No 4). 6 (SEQ ID No:6), 7 (SEQ ID No:7), 8 (SEQ ID No:8), 9 
(SEQ ID No:9), 10 (SEQ ID No:10) and 11 (SEQ ID No: 11) are alternative transcription 
products all comprising at least one nucleotide sequence selected form the group 
consisting of SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, 
SEQ ID No: 17 and SEQ ID No: 18. 

40 

Rgure 10. RT-PCR demonstrating the expression of AMB-1 in B-CLL patients by RT-PCR. 
UPN1-UPN8 are unmutated patients, UPN9-UPN16 are mutated patients. 



wo 2004/046376 



PCT/DK2003/000794 



6 

Detailed description of the invention 

The present Invention In particular relates to polynucleotide sequences found to be 
indicative of a poor prognlsls of B-CLL and new methods and compositions for treating B- 
CLL in particular poor prognosis of B-CLL. An overview of the sequences disclosed by the 
5 present invention Is present here: 

SEQ ID No:l includes 20.000 bp human genome sequence, derived from BAG clone acc. 
no. AC063949. It includes the region encoding the mRNAs corresponding to the cDNAs 
described In Seq ID Nos:2, 4, 6, 7, 8, 9, 10, and 11 and possible up- and down-stream 
10 regulatory sequences. Seq ID No:l Includes a subset of the sequence described In Seq ID 
No:5. The sequence Is derived from human 12q21-22. 

SEQ ID No:2 Includes a cDNA corresponding to a putative mRNA transcript that Includes 
the region encoding the peptide sequence In Seq ID No:3 and up- and downstream 
15 regions. It Is transcribed from the +strand of Seq ID No:l and Seq ID IMo:5 from human 
12q21-22. From position 2317, this sequence Is identical to Seq ID No:4. 

Seq ID No:3 contains a peptide sequence encoded by some mRNAs transcribed from the 

region on human chromosome 12q21-22 included in Seq ID IMo:l and Seq ID No: 5. It Is 
20 encoded by the mRNA sequences Identified as cDNAs In Seq ID Nos 02, 04, 09 and 11. 

Seq ID No:4 Includes a cDNA corresponding to a putative mRNA transcript that Includes 

the region encoding the peptide sequence In Seq ID No:3 and up- and downstream 
regions. It Is transcribed from the +strand of Seq ID No:l and Seq ID No: 5 from human 
25 12q21-22. 

Seq ID No:5 Includes 80.000 bp human genome sequence, derived from BAC clone acc. 
no. AC063949. It is an expansion of the genomic sequence Included in Seq ID No:l. which 
Is contained within this sequence. It includes the region encoding the mRNAs 
30 corresponding to the cDNAs described in Seq ID Nos 02, 04, 06, 07, 08, 09, 10 and 11 and 
possible up- and down-stream regulatory sequences. The sequence Is derived from human 
12q21-22. 

Seq ID No:6 corresponds to a cDNA detected by cDNA cloning, corresponding to an mRNA 
35 transcript. It includes two exons. It Is transcribed from the +strand of Seq ID No:l and 
Seq ID IMo:5 from human 12q21-22. 

Seq ID No:7 corresponds to a cDNA detected by cDNA cloning, corresponding to an mRNA 
transcript. It Includes three exons, the first and third are identical to the two exons in Seq 
40 ID No:6. It is transcribed from the +strand of Seq ID No:l and Seq ID No:5.doc from 
human 12q21-22. 

Seq ID No:8 corresponds to a cDNA detected by cDNA cloning, corresponding to an mRNA 
transcript. It Includes two exons, the last is also present as exon-2 in a human cDNA clone 
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(sequence acc. no. BC036936) (Seq ID No:9.doc). It is transcribed from the +strand of 
Seq ID No: 5.doc fronn human 12q21-22. 

Seq ID No:9 corresponds to a human cDNA sequence (sequence acc. no. BC036936). It is 
5 transcribed from the +strand Seq ID No:5 from 12q21-22. We have not cDNA cloned this 
cDNA, but a splice variant (Seq ID IS!o:8), where exon-2 of this sequence was spliced to 
exon*l of Seq ID No:6 was detected by cDNA cloning. It is transcribed from the +strand of 
Seq ID No: 5 from human 12q21-22. 

10 Seq ID No: 10 corresponds to a cDNA detected partly by cDNA cloning, partly by PGR 
analysis, corresponding to an mRNA transcript. It includes two exons, exon-1 Includes the 
region encoding Seq ID No:3 and exon-1 from Seq ID No:6; exon-2 is identical to exon-2 
In Seq ID No:6 and exon-3 In Seq ID No:7. It Is transcribed from the +strand of Seq ID 
No:l.doc and Seq ID No:5 from human 12q21-22. 

15 

Seq ID No: 11 corresponds to a cDNA detected by cDNA cloning, corresponding to an 
mRNA transcript. It Includes one exon. The sequence Includes the the region that encodes 
Seq ID No:3, exons 2 and 3 from Seq ID No:7 and the region between those exons. It is 
transcribed from the +strand of Seq ID No:l and Seq ID No: 5 from human 12q21-22. 

20 

Seq ID No:12 Is an exon sequence. It corresponds to the first exon In Seq ID No: 9 It Is 
transcribed from the +strand of Seq ID No: 5 from human 12q21-22. 

Seq ID No: 13 Is an exon sequence. It corresponds to the first exon in Seq ID No: 6, 7, 8, 
25 and it Is included in Seq ID No: 2. It is transcribed from the +strand of Seq ID No:l and 5 
from human 12q21-22. 

Seq ID No:14 Is an exon sequence. It is identical to Seq ID No: 13, but with an additional 
GT dinucleotide at the 3'end, caused by the use of an alternative splice site. It can replace 

30 Seq ID No: 13 as the first exon In Seq ID No: 6, 7, 8, and be included in Seq ID No: 2. It is 
transcribed from the +strand of Seq ID No:l and 5 from human 12q21-22. 

Seq ID No:15 Is an exon sequence. It corresponds to the second exon In Seq ID Nb:7 
and it is included in Seq ID No: 2, 4, 10 and 11. It is transcribed from the +strand of Seq 
35 ID No:l and 5 from human 12q21-22. 

Seq ID No:16 Is an exon sequence. It corresponds to the third exon in Seq ID No:7, it is 
the second exon in Seq ID No:6 and 11 and It Is included in Seq ID No:2 and 4. It is 
transcribed from the +strand of Seq ID No: 5 from human 12q21-22. 

40 

Seq ID No:17 Is the sequence encoding the peptide in Seq ID No: 3. It is included in Seq 
ID No:2, 4, 10 and 11, It Is transcribed from the +strand of Seq ID No:5 from human 
12q21-22. 
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Seq ID No:18 Is an exon sequence. It corresponds to the second exon in Seq ID No:8 
and 9. It is transcribed from the +strand of Seq ID No: 5 from human 12q21-22. 



5 Methods of diagnosis 

One Important aspect of the present invention relates to diagnosis of a subtype of B-cell 
chronic lymphocytic leulcaemia (B-CLL) having poor prognosis. A further important aspect 
of the invention relates to prognosis of B-CLL. These methods are based on the discovery 
by the present inventors that an expression product which comprise at least one nucleotide 

10 sequence selected from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID 
No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 is (are) only 
present In particular subtypes of B-CLL associated with poor prognosis and completely 
absent in other subtypes of B-CLL and in healthy tissue (see In particular example 2). By 
completely absent is meant that the expression products are not detiected in any of the 

15 other tissue types with the methods used in the appended examples. This is indicative of a 
complete absence of any transcript or a ver/ low level of transcript in the other tissue 
types. 

The expression product is encoded by SEQ ID No 1 and/or SEQ ID No 5, and the 
20 expression product is selected from the group consisting of transcriptional products and 
translational products. 

Thus, the present invention relates to a method for detecting the presence or absence of 
at least one expression product, wherein the at least one expression product comprise a 
25 nucleotide sequence selected from the group consisting of SEQ ID No:12, SEQ ID No: 13, 
SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 in a 
biological sample isolated from an individual for establishing a differential diagnosis of B- 
CLL or for determining the prognosis of the B-CLL. 
» 

30 "Expression product" is herein meant to be a product which is the result of the expression 
of a polynucleotide such as DNA sequence, e.g. a genomic DNA sequence, and is In the 
form of either a polypeptid or in the form of a polynucleotide, I.e. an expression product 
can be selected from the group consisting of a transcriptional product and a translational 
product. In the case where the expression product is a polynucleotide, said polynucleotide 

35 is preferably mRNA selected from the group consisting of mRNA, pre-mRNA, pre-pro- 
mRNA . 

A "transcriptional product" or a "transcription product" is herein meant to be a product 
resulting from a transcription of a polynucleotide such as a DNA molecule, preferably a 
40 genomic DNA molecule. A transcriptional product is inherently a nucleotide, such as an 
oligonucleotide or a polynucleotide. 



wo 2004/046376 



PCT/DK2003/000794 



9 

A "translational product" or a "translation product" Is herein meant to be a product 
resulting from a translation of a transcriptional product such as a mRNA. A translational 
product is Inherently a oligopeptide or a polypeptide. 

5 The expression product of the present Invention has almost exclusively been found as 
transcription products In patients with poor B-CLL prognosis. Based on the experimental 
data presented In the herein, the inventors expect that it turns out that the subtype of B- 
CLL is characterised solely or better by the presence of a transcriptional or translational 
product which comprise a sequence selected from the group consisting of SEQ ID No: 3, 
10 SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 
and SEQ ID No: 18. 

Preferably the individual is a mammal, more preferably a human. It is also expected that 

the gene encoded by SEQ ID No 1 and/or SEQ ID No 5 and the expression products 
15 derived from said gene can be used as a diagnostic tool in other species in part:icular in 
mammals selected from the group: domestic animals such as cow, horse, sheep, pig; and 
pets such as cat or dog. 

In the case that the expression product is a transcriptional product, this transcriptional 

20 product just needs to comprise at least one of the nucleotide sequences selected from the 
group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID 
No: 16, SEQ ID No: 17 and SEQ ID No: 18. These transcriptional products will preferably be 
derived from SEQ ID No 1 and/or SEQ ID No 5 and may be in the form of mRNA or any 
pre- or pro-forms of said mRNA. As described, the transcriptional product may comprise at 

25 least one of the nucleotide sequences selected from the group consisting of SEQ ID No: 12, 
SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID 
No: 18, such as one of the nucleotide sequences selected from the group consisting of SEQ 
ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and 
SEQ ID No: 18, such as two of the nucleotide sequences selected from the group consisting 

30 of SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID 
No: 17 and SEQ ID No: 18, such as 3 of the nucleotide sequences selected from the group 
consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, 
SEQ ID No: 17 and SEQ ID No:18, e.g. 4 of the nucleotide sequences selected from the 
group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID 

35 No: 16, SEQ ID No: 17 and SEQ ID No: 18, such as 5 of the nucleotide sequences selected 
from the group consisting of SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, 
SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 and ultimately the transcriptional product 
may comprise all of the six nucleotide sequences selected from the group consisting of 
SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 

40 and SEQ ID No:18. The transcriptional product of the present invention can have any 
sequence which is a result of combining the nucleotide sequences selected from the group 
consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, 
SEQ ID No: 17 and SEQ ID No: 18 as long as the specific nucleotide sequences selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
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SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 can be identified as Intact sequences In 
the transcriptional product. 

Examples of transcriptional products In the form of specific mRNAs which comprise at least 
5 one of the nucleotide sequences selected from the group consisting of SEQ ID No: 12, SEQ 
ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 
are the sequences corresponding to SEQ ID No 2 (short cDNA clone) SEQ ID No 4 (long 
cDNA clone) SEQ ID No 6, SEQ ID No 7, SEQ ID No 8, SEQ ID No 9, SEQ ID No 10 and 
SEQ ID No 11. these mRNA sequences have been found in patients with poor prognosis. 

10 

It Is obvious for a person skilled In the art that any fragments of SEQ ID No:2, SEQ ID 
No:4, SEQ ID No:6, SEQ ID No:7, SEQ ID No:8, SEQ ID No:9, SEQ ID No:10 and SEQ ID 
No:ll will have the same diagnostic value as long as the nucleotide sequences selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
15 SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 can be found in the fragments. 

The mRNA sequence may be detected In a sample using hybridisation techniques. In 
particular when more than one analysis is to be peri'ormed at the same time it is 
advantageous to use a DNA array comprising e.g. an oligomer of at least 15 consecutive 
20 bases selected from the group consisting of SEQ ID No:2, SEQ ID No:4, SEQ ID No:6, SEQ 
ID No:7, SEQ ID No:8, SEQ ID No:9, SEQ ID No:10, SEQ ID No:ll, SEQ ID No:12, SEQ ID 
No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18. 

Another way of detecting the presence or absence of the transcriptional product Is by 
25 specifically amplifying the transcriptionals product having a sequence corresponding to 
SEQ ID No 2, 4, 6, 7, 8, 9, 10 or 11 or fragments thereof. This can be done by selecting 
primer pairs which cause only the amplification of these sequences. 

Generally, hybridisation techniques are selected from not limited to the group consisting of 
30 in situ hybridisation, northern blots. Southern blots, dot blots and PCR based techniques. 

A non-limiting list of PCR based techniques Include rt-PCR, quantitative PCR and realtime 
PCR. 

35 According to yet another embodiment, the translational product is a protein encoded by a 
polynucleotide selected from the group consisting of SEQ ID No:l, SEQ ID No: 5, SEQ ID 
No:2, SEQ ID No:4, SEQ ID No:6, SEQ ID No:7, SEQ ID No:8, SEQ ID No:9, SEQ ID No: 10 
and SEQ ID No:ll. Detection of this protein can be done with state of the art methods 
including the detection with an antibody directed against said protein, such as Western 

40 blotting, more preferably by using a fluorescently labelled antibody, preferably wherein the 
method comprises the use of flowcytometry, such as FACS. Other methods include but are 
not limited to gel electrophoresis, gel filtration, Ion exchange chromatography, FPLC, mass 
spectrometry and immunohistochemistry. 
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Preferably, said protein is selected from the group comprising SEQ ID No 3 (protein), or a 
protein sharing at least 60 % sequence identity with SEQ ID No 3. The protein with the 
amino acid sequence set forth in SEQ ID No 3 is the longest open reading frame in the 
cDNA sequence of SEQ ID No 2 or 4. 

5 

In a specific embodiment of the present invention is a method for determining whether an 
individual has a B-CLL sub-type with poor prognosis, the method comprising determining 
the level of an expression product which comprise a nucleotide sequence selected from the 
group consisting of SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID 

10 No: 16, SEQ ID No: 17 and SEQ ID No: 18 of said individual, and indicating the individual as 
having a B-CLL sub-type with poor prognosis If the level of the expression product is at or 
beyond a discriminating value and Indicating the individual as not having a B-CLL sub-type 
with poor prognosis if the level of the expression product is not at or beyond the discrimi- 
nating value, the discriminating value being a value which has been determined by 

15 measuring the level of the expression product which comprise a nucleotide sequence 

selected from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID 
No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 In both a healthy control 
population and a population with l<nown B-CLL sub-type with poor prognosis, thereby 
determining said discriminating value which Identifies the B-CLL sub-type population 

20 having a poor prognosis. 

In this method the individual may be a member of an unselected population or be a 
member of a population already Identified as having a B-CLL sub-type with a poor 
prognosis. 

25 

The above method may be performed such that the determination is performed at several 
time points at Intervals as part of a monitoring of a cancer patient after or during the 
treatment for primary cancer. 

30 The methods described so-far relate to the determination of the presence or absence of an 
expression product which comprise a nucleotide sequence selected from the group 
consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, 
SEQ ID No: 17 and SEQ ID No: 18. By quantitatively measuring the amount of an 
expression product which comprise a nucleotide sequence selected from the group 

35 consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, 
SEQ ID No: 17 and SEQ ID No: 18 in a biological sample isolated from an individual, it is 
possible to predict the progression of B-CLL In an Individual. 

In one embodiment the quantitative measurement Is performed during treatment to 
40 estimate the efficiency of such treatment. 

A preferred embodiment of the above diagnostic and prognostic methods is a method for 
detecting the presence or absence an expression product, wherein said at least one 
expression product comprise the nucleotide sequence of SEQ ID No: 15 in a biological 
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sample isolated from an individual for establishing a differential diagnosis of B-CLL or for 
determining the prognosis of the B-CLL 

A further preferred embodiment of the above diagnostic and prognostic methods is a 
5 method for detecting the presence or absence an expression product, wherein said at least 
one expression product comprise the nucleotide sequence of SEQ ID No: 16 in a biological 
sample isolated from an individual for establishing a differential diagnosis of B-CLL or for 
determining the prognosis of the B-CLL. 

10 Yet a further preferred embodiment of the above diagnostic and prognostic methods Is a 
method for detecting the presence or absence an expression product, wherein said at least 
one expression product comprise the nucleotide sequence spanning the junction sequence 
between Exon-2 (SEQ ID No: 15) and Exon-3 (SEQ ID No: 16) in a biological sample 
isolated from an Individual for establishing a differentia! diagnosis of B-CLL or for 

15 determining the prognosis of the B-CLL 

The nucleotide sequence spanning the junction between Exon-2 and Exon-3 is the last 20 
nucleotides of the 3'-end of SEQ ID No: 15 and the first 20 nucleotides of the 5'-end of SEQ 
IDNo:16. 

20 

The "junction sequence" between two nucleotide sequences, such as two exons, is herein 
defined as the at least 20 3'-nucleotides of the first exon which Is located 5' relative to the 
second exon and the at least 20 5'-nucleotides of the second exon which is located 3' 
relative to the first exon, 

25 

For all diagnostic applications of the present invention, the biological sample may be 
selected from the group comprising blood, serum, plasma, urine, saliva, lymph node 
biopsy, bone marrow, spinal liquid, spleen biopsy, and liver biopsy. The cells to be 
assessed in a sample are preferably leukocytes, mononuclear leukocytes or lymphocytes or 
30 B-lymphocytes. 

A further embodiment of the present Invention also includes a diagnostic kit for ex vivo or 
\n situ diagnosis of a subtype of B-ceil chronic lymphocytic leukaemia (B-CLL) in a 
individual, the kit comprising a detector molecule capable of detecting the presence or 
35 absence of at least one expression product, wherein said at least one expression product 
comprise a nucleotide sequence selected from the group consisting of SEQ ID SEQ ID 
No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and 
SEQ ID No: 18 in a biological sample isolated from the individual. 

40 The detector molecule is preferably a nucleotide and even more preferably a nucleotide 
capable of hybridising to a nucleotide sequence selected from the group consisting of SEQ 
ID SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID 
No: 17 and SEQ ID No: 18 under stringent condition. 
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B-CLL therapy 

With the Identification of a new sub-type of B-CLL having a poor prognosis, the present 
inventors also provide methods for treatment of B-CLL in such patients. This method Is 
based on the finding that transcription products comprising these sequence products are 
5 present in B-CLL cells of patients with the poor prognosis. By modifying the activity and/or 
level of these transcription products, a treatment and/or cure for B-CLL Is provided. 

Accordingly, In a therapeutic aspect of the present Invention there Is provided a method of 
treating a B-CLL sub-type with poor prognosis comprising administering to an individual 
10 with a poor prognosis B-CLL diagnosis a compound capable of decreasing or Inhibiting the 
formation of an expression product which comprise a nucleotide sequence selected from 
the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ 
ID No: 16, SEQ ID No: 17 and SEQ ID No: 18. 

15 One such method Is based on administering an oligonucleotide capable of inhibiting 

transcription from SEQ ID No 1 and/or SEQ ID No 5. Said oligonucleotide may comprises 
at least 8-10 consecutive nucleotides from the sequence of SEQ ID No 1. These sequences 
constitute the putative promoter sequences controlling the transcription transcription 
products which comprise a nucleotide sequence selected from the group consisting of SEQ 

20 ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and 
SEQ ID No: 18. The oligonucleotides bind specifically to the promoter sequences and inhibit 
transcription of the gene. Such oligonucleotides may comprises nucleotide monomers 
selected from the group: DNA, RNA, LNA, PNA, methylated DNA, methylated RNA, more 
preferably PNA or LNA. 

25 

In a more preferred embodiment the therapeutic methods comprise administering an 
oligonucleotide capable of binding to a transcriptional product and preventing translation 
by destroying the transcriptional product. One particularly preferred embodiment of this 
aspect Is RNA interi'erence (RNAI) oligonucleotides. 

30 

The discover/ of the phenomenon RNAI has revealed an entirely new level of gene 
regulation in eukaryotic cells. It is based on the observation that the presence of long 
double stranded RNA (dsRNA) in a cell almost completely eliminates the expression of the 
gene having the same sequence, whereas expression of other unrelated genes are left 

35 undisturbed. Although this observation had been know for time In plants as 

posttranscriptional gene silencing (PTGS) it was not before it was characterised as a 
general mechanism throughout the animal kingdom that its potentials were fully 
appreciated. Over the last few years it has been developed as a robust technique to knock 
down any desirable gene in worms and flies, and quickly a large body of information was 

40 gathered about the function of genes in these organisms. Due to the activation of the 
interferon system by long dsRNA the RNAi method was at that time not applicable in a 
mammalian system. 
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A key observation that allowed the harnessing of RNAi as a tool for regulating gene 
expression In mammals was the observation that chemically synthesised oligo-mer small 
interfering RNAs (siRNA) effectively suppress gene expression in several human cell lines 
without inflecting interferon response. This has triggered new promises for sIRNA as a 
5 therapeutic drug in humans. 

RNAi works by hybridising specifically to the mRNA transcribed by the cell to form a 
(partly) double stranded RNA molecule. This Is recognised as a double stranded molecule 
by the cell's own nucleases, which degrade them. 

10 

In order for the technique to work efficiently, the siRNA oligonucleotide comprises a 
sequence of 5-30 consecutive nucleotides which is the complementary sequence of the 
nucleotide sequences selected from the group consisting of SEQ XD No:12, SEQ ID No:13, 
SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No:18. By 

15 targeting at least one of the nucleotide sequences selected from the group consisting of 
SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 
and SEQ ID No: 18, the transcriptional products characterised by the nucleotide sequences 
from the group of SEQ ID No:2, SEQ ID No:4, SEQ ID No:6, SEQ ID No:7, SEQ ID No:8, 
SEQ ID No:9, SEQ ID No:10 and SEQ ID No:ll will be eliminated. Example 5 shows that 

20 cells characteristic for the poor prognosis B-CLL sub-type can be eliminated by destroying 
the herein mentioned transcription products. 

By "complementarity" is meant a nucleic acid that can form hydrogen bond(s) with other 
DNA or RNA sequence by either traditional Watson-Crick or other non-traditional types of 
25 base-paired interactions, e.g. Hoogsteen type. 

Preferred sIRNA molecules of the present invention are between 5 to 30 nucleotides long, 
such as 8-30 nucleotides long, such as 8-25 nucleotides, e.g. 8-24 nucleotides, e.g. 8-23 
nucleotides, e.g. 8-22 nucleotides, e.g. 8-21 nucleotides, such as 8-20 nucleotides, e.g. 9- 

30 23 nucleotides, e.g. 10-23 nucleotides, such as 11-23 nucleotides, e.g. 12-23 nucleotides 
such as 13-23 nucleotides, e.g. 14-23 nucleotides, e.g. 15-23 nucleotides, such as 16-23 
nucleotides, such as 8 nucleotides, 9 nucleotides, 10 nucleotides, 11 nucleotides, 12 
nucleotides, 13 nucleotides, 14 nucleotides, 15 nucleotides, 16 nucleotides, 17 nucleotides, 
18 nucleotides, 19 nucleotides, 20 nucleotides, 21 nucleotides, 22 nucleotides, 23 

35 nucleotides, 24 nucleotides, 25 nucleotides, 26 nucleotides, 27 nucleotides, 28 nucleotides, 
29 nucleotides, such as 30 nucleotides long. 

RNAi oligonucleotides may be administered to the cell, or a vector may be transfected into 
the cells, said vector comprising a promoter region capable of directing the expression of 
40 at least one RNAi oligonucleotide. Due to the very restricted expression of the AMB-1 gene, 
it is not Important only to target the RNAi oligos or the vectors to B-CLL cells. 

One way of targeting to blood cells comprises using a heparin receptor for targeting to 
blood cells. 
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Another way of targeting the transcriptional products which comprise at !east one 
nucleotide sequence selected from the group consisting of SEQ ID No: 12, SEQ ID No:13, 
SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 is to use 
5 an antlsense construct comprising a promoter sequence capable of directing the 

transcription of at least part of the antlsense equivalent of SEQ ID No 1 or 2 or 4 or 5 or 6 
or 7 or 8 or 9 or 10 or 11 specifically to the poor prognosis B-CLL sub-type. 

When desired targeting to B-CLL cells can be performed using the CD19 or CD20 receptor. 
10 The CD19 receptor is particularly preferred since it internalises its llgand. 

In a further therapeutic embodiment the compound is a gene therapy vector comprising a 
promoter sequence operably linked to a sequence coding for a protein capable of Inhibiting 
cell division in the cell and/or capable of killing the cell, said promoter sequence being a 
15 tissue specific promoter capable of directing expression only in B cells, more preferably 
only in B-CLL cells. One particulariy preferred promoter sequence is the extremely cell 
specific promoter of SEQ ID No:l or SEQ ID No:5. When this promoter is used targeting of 
the suicide vector is not very important, since it will only be active in the cells in which 
AMB-1 Is expressed and these are the cells to be targeted by the suicide gene. 

20 

Deletion studies will determine the exact length of the promoter sequence counted from 
the transcription start site. Accordingly, the promoter may comprise at least 100 
nucleotides of Seq_ID:l or SeqJD:5, such as at least 200 nucleotides, for example at 
least 300 nucleotides, such as at least 400 nucleotides, for example at least 500 

25 nucleotides, such as at least 600 nucleotides, for example at least 700 nucleotides, such as 
at least 800 nucleotides, for example at least 900 nucleotides, such as at least 1000 
nucleotides, for example at least 1100 nucleotides, such as at least 1200 nucleotides, for 
example at least 1300 nucleotides, such as at least 1400 nucleotides, for example at least 
1500 nucleotides, such as at least 1600 nucleotides, for example at least 1700 nucleotides, 

30 such as at least 1800 nucleotides, for example at least 1900 nucleotides, such as at least 
2000 nucleotides, for example at least 2500 nucleotides, such as at least 3000 nucleotides, 
for example at least 3500 nucleotides, such as at least 5000 nucleotides, for example at 
least 10,000 nucleotides. 

35 The specificity of expression of mRNAs described by the present invention are striking. The 
RT-PCR data and the Northern blot data using the dot-blot disclose that the mRNAs of the 
present invention are expressed either at very low levels in other tissues or only in the B- 
CLL patients where one can detect it. 

40 Thus one embodiment of the present Invention is the use of the promotor region for use In 
gene therapy. The promotor is defined as any sequence within SEQ ID No:l and SEQ ID 
No: 5 that directs the formation of an expression product which comprise a nucleotide 
sequence selected from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID 
No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No:18, said expression 
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product being any of the sequences selected from the group consisting of SEQ ID No: 2, 
SEQ ID No:4, SEQ ID No:6, SEQ ID No:7, SEQ ID No:8, SEQ ID No:9, SEQ ID No: 10 and 
SEQ ID No: 11 in B-CLL cells or any other cell or tissue types in which any of the sequences 
are transcribed. 

5 

One way that one can use the promotor region in gene therapy Is to mal<e a gene therapy 
construct where the promotor drives the expression of a cell suicide gene such as but not 
limited to the gene for HSV-1 thymidine Icinase, the varicella-zoster, virus thymidine 
Icinase gene, E.Coli cytosine deaminase, the nitroreductase gene or the E.Coli Deo gene 
10 (Yazawa et al. & Kirn D. et a!.). This would allow for selective expression of the suicide 
genes in B-CLL cells. 

Alternatively, the promotor could be used for a selective expression of genes that could 
have curative effects when expressed In B-CLL cells, but unwanted effects if expressed 
15 ubiquitously. 

Also, one embodiment of the present invention relates to the use of the promotor region 
for use in screening assays where the promotor is linked to a reporter gene and 
transfected Into B-CLL cells In which the reporter gene will be expressed. This approach 
20 would allow for easy screening for compounds that would turn off the expression of the 
reporter gene for example by killing the cell. 

A presently preferred embodiment relates a gene therapy vector of the present invention 
comprising an oligonucleotide capable of inhibiting transcription from SEQ ID No 1 and/or 
25 SEQ ID No 5, wherein the promoter is a B-CLL specific promoter, which may or may not be 
operably linked to a protein selected from the group comprising HSV-1 thymidine kinase, 
the varicella-zoster, virus thymidine kinase gene, E.Coli cytosine deaminase, the 
nitroreductase gene or the E.Coli Deo gene. 

30 In one embodiment the compound is a therapeutic antibody directed against a polypeptide 
having the amino acid sequence of SEQ ID No 3, preferably wherein said antibody Is a 
human or humanised antibody. Another possibility is to Identif/ a modulator of binding of 
SEQ ID No 3 to its receptor within or outside the cell and to administer this modulator to 
the cells. 

35 

4-hellcal cytokines 

A further object of preferred embodiments of the present invention is an isolated 
polypeptide comprising an amino acid sequence selected from the group consisting of: 

40 I) an amino acid sequence of SEQ ID NO: 3, 

ii) an amino acid sequence having at least 60% sequence identity compared to 
the full length sequence of SEQ ID N0:3 
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li) a fragment of SEQ ID NO: 3 having at least 60% sequence identity compared 
to the full length sequence of SEQ ID N0:3. 

The protein encoded by SEQ ID No 1 and/or SEQ ID No 5 shares a very small sequence 
5 Identity with any known protein. However, it has been possible to use 2D and 3D analytical 
tools to Identif/ the protein as a 4-helical cytokine. The 3D structure of the protein is very 
similar to 4-hellcal cytokines and in particular to IL4. 

IL4 Is a very important cytokine In B-CLL biology. IL4 is not expressed by B-CLL cells, but 
10 the IL4 receptor Is found on the cells. The IL4 that stimulates B-CLL cells is believed to be 
produced by T-lymphocytes. The role of IL4 in B-CLL biology is complicated. It has been 
suggested that IL4 can Inhibit B-CLL DNA synthesis and proliferation. Other reports 
demonstrated that IL4 protects B-CLL cells from apoptosis by upregulating Bcl-2, and IL4 
was shown to Inhibit apoptosis without stimulating proliferation. Recently, a clinical study 
15 in Sweden has confirmed these In vitro studies since IL4 administration to B-CLL patients 
resulted In increased numbers of B-CLL cells in the blood, suggesting that IL4 had a 
stimulatory or antl-apoptotic effect on the B-CLL cells In vivo (Lundln, et ah). 

In many systems the effects of IL13 are largely similar to those of IL4, but IL13 is slightly 
20 less potent that IL4. It is unclear whether B-CLL cells express IL13, but the cells do 

express the IL13 receptor. The effects of IL13 in B-CLL are controversial. While ChaouchI 
et al. suggested that IL13, like IL4 protects B-CLL cells from apoptosis (ChaouchI et al), 
studies by Fluckiger et al. suggest that this Is not the case (Flucklger et al.). 

25 The combined finding of 2D and 3D structure similarity to 4-helical cytokines and the 
Importance of IL4 in B-CLL strongly suggests that the novel class of proteins of which the 
AMB-1 protein is one representative are cytokines. 

These polypeptides constitute a novel class of proteins sharing 2D and 3D structure 
30 similarities with 4-hellcal cytokines. In a preferred embodiment, the isolated polypeptide 
comprises or essentially consists of the amino acid sequence of SEQ ID No. 3 or a 
fragment thereof. This particular protein at least can be used for diagnosis, for raising 
antibodies for use In therapy against B-CLL, and for protective or therapeutic immunisation 
of an individual against B-CLL. 

35 

Consequently, the isolated polypeptide preferably has interleukin activity or Interleukin like 
activity, such as having IL3, IL13, GM-CSF, TGF-p, IGF activity, more preferably having 
IL4 activity or IL4 like activity. 

40 Probably the isolated polypeptides are capable of forming homo- or hetero-oligomer with 
each other and among themselves. Such oligomers are also within the scope of the present 
invention. Such oligomers may comprise at least one isolated polypeptides as defined In 
any the present invention, such as a dimer, a trimer, a quatramer, a quintamer, a 
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hexamer, an octamer, a decamer, a dodecamer. In biological systems the activity may be 
attributed only to dimer or higher -mer. 

The protein defined by SEQ ID No 3 shares very little sequence Identity with known 
5 cytoicines and Interieuklnes and as a matter of fact very little sequence identity with any 
known protein. Consequently the present Inventors contemplates that the group comprises 
functionally equivalent polypeptide sharing at least 60% sequence identity with SEQ ID No 
3, more preferably at least 70% sequence Identity, more preferably at least 80 % 
sequence Identity, such as at least 90 % sequence Identity, for example at least 95 % 
10 sequence Identity, such as at least 97 % sequence Identity, for example at least 98 % 
sequence identity. 

Activity as a cytokine or interleukfn can be assessed in a biological assay where the 
polypeptide is contacted with a cytokine dependent cell line. Accordingly, polypeptides 
15 with cytokine or Interleukln like activity can also be identified by similar methods. 

One approach to assess cytokine/lnterleukin activity in a biological assay Is to express the 
CDS (SEQ ID No: 17) reading frame In a baculovlrus system (Invltrogen, Carlsbad, USA) 
and purify the protein . The recombinant protein can be assayed in cytokine induced 
20 proliferation assays as described in general in the eBiosclence catalog & Reference Manual 
2002 p.260-262 (eBioscience, San Diego, USA). In particular IL4 activity can be 
determined using the CTh4S cell line as described by Petersen et al (see Example 8). 

The promoter sequence (which forms part of SEQ ID No 1 and/or SEQ ID No 5) and the 
25 coding sequences (SEQ ID No:3) can be used in various aspects of gene therapy and 
immunotherapy. 

Further polynucleotide sequences from other individuals or other species with the same 
function can be Isolated by one of the following methods, which each form independent 
30 aspects of the present Invention. 

A first method for identifying a nucleotide sequence encoding a 4-hellcal cytokine 
comprises the steps of: 

i) isolating mRNA from a biological sample, 

35 ii) hybridising the mRNA to a probe comprising at least 10 nucleotides of the coding 
sequence of SEQ ID No 1 and/or SEQ ID No 5 under stringent conditions, 
ill) determining the nucleotide sequence of a sequence capable of hybridising under step 

ii) , and 

iv) determining the presence of an open reading frame In the nucleotide sequence 
40 determined under step ill). 

A second method for Identifying a nucleotide sequence encoding a 4-hellcal cytokine is a 
computer assisted method comprising the steps of 



wo 2004/046376 



PCT/DK2003/000794 



19 

I) performing a sequence similarity search of at least 10 nucleotides of the coding 
sequence SEQ ID No 1 and/or 5EQ ID No 5, 

II) aligning ''hits" to said coding sequence, 

ill) determining the presence of an open reading frame in the ''hits". 

5 

It Is highly likely that other similar polypeptides encoding further 4-helical cytokines can be 
found in other individuals and/or other species of mammals. In particular, individuals of 
other geographical origin may carry genes which differ from the polynucleotides of the 
present Invention. It is also conceivable that similar sequences can be found in closely and 
10 even in distantly related species. 

Functional Equivalents 

Modification and changes may be made in the structure of the peptides of the present 
invention and DI^A segments which encode them and still obtain a functional molecule that 
15 encodes a protein or peptide with desirable characteristics. The following Is a discussion 
based upon changing the amino acids of a protein to create an equivalent, or even an 
improved, second-generation molecule. The amino acid changes may be achieved by 
changing the codons of the DNA sequence, according to the genetic code. 

20 For example, certain amino acids may be substituted for other amino acids in a protein 
structure without appreciable loss of interactive binding capacity with structures such as, 
for example, antigen-binding regions of antibodies, binding sites of receptors, or binding 
sites on substrate molecules. Since it Is the interactive capacity and nature of a protein 
that defines that protein's biological functional activity, certain amino acid sequence 

25 substitutions can be made In a protein sequence, and, of course, its underlying DNA coding 
sequence, and nevertheless obtain a protein with like properties. It is thus contemplated 
by the inventors that various changes may be made In the peptide sequences of the 
disclosed compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 

30 

In making such changes, the hydropathic Index of amino acids may be considered. The 
importance of the hydropathic amino acid index In conferring interactive biologic function 
on a protein is generally understood in the art (Kyte and Doolittle, 1982, incorporate 
herein by reference). It Is accepted that the relative hydropathic character of the amino 

35 acid contributes to the secondary structure of the resultant protein, which in turn defines 
the interaction of the protein with other molecules, for example, enzymes, substrates, 
receptors, DNA, antibodies, antigens, and the like. Each amino acid has been assigned a 
hydropathic index on the basis of their hydrophobiclty and charge characteristics (Kyte and 
Doolittle, 1982), these are: Isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine 

40 (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); 

threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine 
(-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (- 
3,9); and arginlne (-4.5). 



wo 2004/046376 



PCT/DK2003/000794 



20 

It is known in the art that certain amino acids may be substituted by other amino acids 
having a similar hydropathic index or score and still result in a protein with similar . 
biological activity, le, still obtain a biological functionally equivalent protein. In making 
such changes, the substitution of amino acids whose hydropathic indices are within ±2 Is 
5 preferred, those which are within ±1 are particularly preferred, and those within ±0.5 are 
even more particularly preferred. It is also understood in the art that the substitution of 
like amino acids can be made effectively on the basis of hydrophilicity. U.S. Pat. No. 
4,554,101, incorporated herein by reference, states that the greatest local average 
hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, 
10 correlates with a biological property of the protein. 

As detailed in U.S. Pat No. 4,554,101, the following hydrophilicity values have been 
assigned to amino add residues: arglnlne (+3.0); lysine (+3.0); aspartate (+3.0±1); 
glutamate (+3.0il); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); 

15 threonine (-0.4); proline (-0.5±1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); 
methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); 
phenylalanine (-2.5); tryptophan (-3.4). It Is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and stilt obtain a biologically 
equivalent, and in particular, an immunologically equivalent protein. In such changes, the 

20 substitution of amino acids whose hydrophilicity values are within ±2 is preferred, those 
which are within ±1 are particularly preferred, and those within ±0.5 are even more 
particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based on the relative 
25 similarity of the amino acid side-chain substituents, for example, their hydrophoblclty, 

hydrophilicity, charge, size, and the like. Exemplary substitutions which take various of the 
foregoing characteristics into consideration are well known to those of skill in the art and 
include: arglnlne and lysine; glutamate and aspartate; serine and threonine; glutamine 
and asparagine; and valine, leucine and isoleucine. 

30 

Functional equivalents and variants are used interchangabiy herein. In one preferred 
embodiment of the invention there is also provided variants of a 4-helical cytokine, and 
variants of fragments thereof. When being polypeptides, variants are determined on the 
basis of their degree of identity or their homology with a predetermined amino acid 
35 sequence, said predetermined amino acid sequence being SEQ ID No. 3 or a fragment 
thereof. 

Accordingly, variants preferably have at least 60 % sequence identity, for example at least 
65% sequence identity, such as at least 70 % sequence identity, for example at least 75% 
40 sequence identity, for example at least 80% sequence Identity, such as at least 85 % 
sequence Identity, for example at least 90 % sequence identity, such as at least 91 % 
sequence identity, for example at least 91% sequence Identity, such as at least 92 % 
sequence identity, for example at least 93 % sequence identity, such as at least 94 % 
sequence identity, for example at least 95 % sequence identity, such as at least 96 % 
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sequence identity, for example at least 97% sequence identity, sucii as at least 98 % 
sequence identity, for example 99% sequence identity with the predetermined sequence. 

To determine the percent identity of two amino acid sequences or of two nucleic acids, the 
5 sequences are aligned for optimal comparison purposes (e.g., gaps can be Introduced In 
the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a 
second amino or nucleic acid sequence). The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence Is occupied by the same amino acid residue or nucleotide as 

10 the corresponding position in the second sequence, then the molecules are identical at that 
position. The percent identity between the two sequences is a function of the number of 
Identical positions shared by the sequences (I.e., % identity # of Identical positions/total # 
of positions (e.g., overlapping positions) x 100). In one embodiment the two sequences 
are the same length. The sequence identity is preferably calculated relative to the full 

15 length sequence of the molecule of the present invention. 

A degree of "sequence Identity" of amino acid sequences Is a function of the number of 
identical amino acids at positions shared by the amino acid sequences. A degree of 
"sequence homology" or "sequence similarity" of amino acid sequences Is a function of the 
20 number of amino acids, i.e. structurally related, at positions shared by the amino add 
sequences. Sequence Identity is determined by the alignment algorithm that performs 
global alignments which has been described by Smith TF and Waterman MS (Smith TF et 
al.) 

25 A list of the standard qualifiers and the default values for the alignment algorithm Is given 
below: 



standard (Mandatory) qualifiers 


Allowed values 


Default 


[-asequence] 
(Parameter 1) 


Sequence USA 


Readable sequence 


Required 


[-bsequence] 
(Parameter 2) 


Sequence database USA 


Readable 

sequence(s) 


Required 


-gapopen 


The gap open penalty Is the score taken 
away when a gap is created. The best value 
depends on the choice of comparison 
matrix. The default value assumes you are 
using the EBLOSUM62 matrix for protein 
sequences, and the EDNAFULL matrix for 
nucleotide sequences. 


Number from 1.000 
to 100.000 


10.0 for any 
sequence 
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-gapextend 


The gap extension penalty \s added to the 
standard gap penalty for each base or 
residue tn the gap. This is how long gaps are 
penalized. Usually you will expect a few long 
gaps rather than many short gaps, so the 
gap extension penalty should be lower than 
the gap penalty. An exception is where one 
or both sequences are single reads with 
possible sequencing errors In which case 
you would expect many single base gaps. 
You can get this result by setting the gap 
open penalty to zero (or very low) and using 
the gap extension penalty to control gap 
scoring. 


Number from 0.100 
to 10.000 


0.5 for any 
sequence 


[-outfile] 
(Parameter 3) 


Output alignment file name 


Alignment output 
file 




Additional (Optional) qualifiers 


Allowed values 


Default 


-datafite 


This is the scoring matrix file used when 
comparing sequences. By default it is the 
file 'EBLOSUI^62' (for proteins) or the file 
'EDNAFULL' (for nucleic sequences). These 
files are found in the 'data' directory of the 
EMBOSS installation. 


Comparison matrix 
file in EMBOSS data 
path 


EBLOSUM62 for 
protein EDNAFULL 
forDNA 


Advanced (Unprompted) qualifiers 


Allowed values 


Default 


-[no]brief 


Brief identity and similarity 


Boolean value 
Yes/No 


Yes 



An "unrelated" or "non-liomologous" sequence shares less than 40% identity, though 
preferably less than 25% identity, with one of the 4-hellcal cytokine sequences of the 
5 present invention. The term "substantial identity" means that two peptide sequences, 
when optimally aligned, such as by the Smith and Waterman algorithm using default gap 
weights, share at least 80 percent sequence identity, preferably at ieast 90 percent 
sequence Identity, more preferably at least 95 percent sequence identity or more (e.g., 99 
percent sequence identity). Preferably, residue positions which are not identical differ by 
10 conservative amino acid substitutions. 

Additionally, variants are also determined based on a predetermined number of 
conservative amino add substitutions as defined herein below. Conservative amino acid 
substitution as used herein relates to the substitution of one amino acid (within a 
15 predetermined group of amino acids) for another amino add (within the same group), 
wherein the amino adds exhibit similar or substantially similar characteristics. 

Within the meaning of the term ''conservative amino acid substitution" as applied herein, 
one amino acid may be substituted for another within the groups of amino acids indicated 
20 herein below: 
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• Amino acids having polar side chains (Asp, Glu, Lys, Arg, His, Asn, Gin, Ser, Thr, Tyr, 
and CyS;) 

• Amino acids having non-polar side chains (Giy, Ala, Val, Leu, He, Phe, Trp, Pro, and 
iviet) 

5 • Amino acids having aliphatic side chains (Gly, Ala Vat, Leu, lie) 

• Amino acids having cyclic side chains (Phe, Tyr, Trp, His, Pro) 

• Amino acids having aromatic side chains (Phe, Tyr, Trp) 

• Amino acids having acidic side chains (Asp, Giu) 

• Amino acids having basic side chains (Lys, Arg, His) 
10 • Amino acids having amide side chains (Asn, Gin) 

• Amino acids having hydroxy side chains (Ser, Thr) 

• Amino acids having sulphor-containing side chains (Cys, I^et), 

• Neutral, weakly hydrophobic amino acids (Pro, Ala, Gly, Ser, Thr) 

• Hydrophllic, acidic amino acids (Gin, Asn, Glu, Asp), and 
15 • Hydrophobic amino acids (Leu, He, Val) 

Preferred conservative amino acids substitution groups are: valine-leucine-lsoleuclne, 
phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine-glutamine. 

20 Accordingly, a variant or a fragment thereof according to the invention may comprise, 
within the same variant of the sequence or fragments thereof, or among different variants 
of the sequence or fragments thereof, at least one substitution, such as a plurality of 
substitutions introduced independently of one another. 

25 It is clear from the above outline that the same variant or fragment thereof may comprise 
more than one conservative amino acid substitution from more than one group of 
conservative amino acids as defined herein above. 

The addition or deletion of at least one amino acid may be an addition or deletion of from 
30 preferably 2 to 250 amino acids, such as from 10 to 20 amino acids, for example from 20 
to 30 amino acids, such as from 40 to 50 amino acids. However, additions or deletions of 
more than 50 amino acids, such as additions from 50 to 100 amino acids, addition of 100 
to 150 amino acids, addition of 150-250 amino acids, are also comprised within the 
present Invention. The deletion and/or the addition may - independently of one another - 
35 be a deletion and/or an addition within a sequence and/or at the end of a sequence. 

The polypeptide fragments according to the present Invention, including any functional 
equivalents thereof, may In one embodiment comprise less than 250 amino acid residues, 
such as less than 240 amino acid residues, for example less than 225 amino acid residues, 
40 such as less than 200 amino acid residues, for example less than 180 amino acid residues, 
such as less than 160 amino acid residues, for example less than 150 amino acid residues, 
such as less than 140 amino acid residues, for example less than 130 amino acid residues, 
such as tess than 120 amino acid residues, for example less than 110 amino acid residues, 
such as less than 100 amino acid residues, for example tess than 90 amino acid residues. 
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such as less than 85 amino acid residues, for example less than 80 amino acid residues, 
such as less than 75 amino acid residues, for example less than 70 amino acid residues, 
such as less than 65 amino acid residues, for example less than 60 amino acid residues, 
such as less than 55 amino acid residues, for example less than 50 amino acid residues. 

5 

''Functional equivalency" as used in the present invention is according to one preferred 
embodiment established by means of reference to the corresponding functionality of a 
predetermined fragment of the sequence. 

10 Functional equivalents or variants of a 4-helical cytoi<lne will be understood to exhibit 
amino acid sequences gradually differing from the preferred predetermined 4-helical 
cytokine, as the number and scope of insertions, deletions and substitutions including 
conservative substitutions increases. This difference is measured as a reduction in 
homology between the preferred predetermined sequence and the fragment or functional 

15 equivalent. 

All fragments or functional equivalents of SEQ ID No. 3 are included within the scope of 
this invention, regardless of the degree of homology that they show to the respective, 
predetermined 4-heIical cytokines disclosed herein. The reason for this is that some 
20 regions of the 4-helical cytokines are most likely readily mutatable, or capable of being 
completely deleted, without any significant effect on the binding activity of the resulting 
fragment. 

A functional variant obtained by substitution may well exhibit some form or degree of 
25 native cytokine activity, and yet be less homologous, if residues containing functionally 
similar amino acid side chains are substituted. Functionally similar in this respect refers to 
dominant characteristics of the side chains such as hydrophobic, basic, neutral or acidic, or 
the presence or absence of steric bulk. Accordingly, in one embodiment of the invention, 
the degree of identity is not a principal measure of a fragment being a variant or functional 
30 equivalent of a preferred predetermined fragment according to the present invention. 

One particularly preferred method of determining the degree of functional equivalence is 
by performing a biological or chemical assay such as the assays described In the appended 
examples. Preferred functional equivalents of SEQ ID No 3 are those that have a Kp with 
35 respect to a predefined receptor which is less than 10 times higher than the Kp of the 

polypeptide of SEQ ID No 1 with respect to the same receptor, more preferably less than 5 
times higher, more preferably less than 2 times higher. 

With respect to functional equivalence this may be defined in a biological assay based on a 
40 cytokine dependent or stimulated cell line. Such cell lines ar-e e.g. available fi-om American 
Type Culture Collection, P.O.Box 1549, Manassas, VA 20108 USA. The following cell lines 
at least are available for testing cytokines and In particular Interieukins: 
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Accession number 

CRL-1841 

CRL-2003 

CRL-2407 

CRL-2408 

CRL-2409 

CRL-9589 

CRL-9591 

TIB-214 



10 



Description 

TH-2 clone ABE 

TF-1 

NK-92 

NK-92MI 

NK92CI 

AML-193 

MV-4-11 

CTLL-2 



Activity 

IL2 dependent, IL4 stimulated 

IL3 dependent 

IL2 dependent 

IL2 dependent 

IL2 dependent 

IL3 stimulated, GM-CSF dep. 

GM-CSF dependent 

IL2 dependent 



The following cell lines are available from DSMZ - Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH, Mascheroder Weg lb, D-38124 Braunschweig, 
GERMANY. As can be seen from the table, some of the cell lines can be used to broadly 
assess cytokine activity whereas others are only reported to respond to one or a few 
15 specific cytokines. 



Accession 
number 


Description 


Acvitlty 


ACC 211 


Mouse hybridoma, B9 


IL6 dependent 


ACC 137 


Human acute myeloid leukemia, UT-7 


Constitutively cytokine 
responsive to various 
cytokines. 


ACC 104 


Human acute megakaryoblastic leukemia 


Respond with proliferation 
to: GM-CSF, IFN-alpha, 
IFN-a, IFN-gamma, IL2, 
IL3, IL4, IL6, IU5, N6 F, 
SCF, TNF-alpha, TPO 


ACC 247 


Human acute myeloid leukemia, OCI- 
AML5 


G-GSF, GM-CSF, IL3, FTL3- 
ligand 


ACC 271 


Human acute myeloid leukemia, MUT2-2 


IL3, SCF, G-CSF, M-CSF, 

IFIM-gamma 


ACC 334 


Human erythroleukemia, TF-1 


GM-CSF, IFN-gamma, IL3, 
IL4, IL5, IL6, IL13, LIF, 
NGF, OSM, SCF, TNF- 
alpha, and TPO 



The TF-1 cell line mentioned above can be used for assaying IL13 function. This cell line is 
sensitive to various different cytokines but gives a very strong proliferative response when 
20 exposed to IL13. This cell line can in particular be used if there is no response in the IL4 
sensitive cell line (CT.h4S). Further cell lines which can be used for distinguishing between 
IL4 and IL13 activity include cell lines/hybridomas such as B-9-1-3 (Bouteiller et al.) and 
A201.1 (Andrews et al.). 
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Pharmaceutical uses of isolated polypeptides 

Apart from being used for diagnosis, it is also within the scope of the present invention to 
use an Isolated polypeptide as defined In the invention for a pharmaceutical composition 
together with a pharmaceutically acceptable carrier. Such pharmaceutical compositions 
5 may be used for any of the purposes for which cytokines and in particular Interleukin Is 
used at present. 

Examples of such uses include the treatment of bone disorders, inflammation, for lowering 
blood serum cholesterol, allergy, infection, viral infections, hematopoietic disorders, 
10 preneoplastic lesions, Immune related diseases, autoimmune related diseases, infectious 
diseases, tuberculosis, cancer, viral diseases, septic shock, reconstltution of the 
haematopoietic system, induction of the granulocyte system, pain, cardial dysfunction, 
CNS disorders, depression, artheritis, psoriasis, dermatitis, collitis, Crohn's disease, 
diabetes, in an individual In need thereof. 

15 

It Is also within the scope of the present invention to use an isolated polypeptide according 
to the invention as an adjuvant or as an immune anhancer, for regulating TH2 immune 
responses, and for suppressing Thl Immune responses. 

20 A further use of an isolated polypeptide of the invention is as a growth factor for 

administration to cell cultures or as a growth factor for veterinary use, e.g. for stimulating 
the growth of livestock. 

Immunotherapy 

25 Having identified a transcriptional and/or translational product of SEQ ID No 1 and/or SEQ 
ID No 5 as an etiological factor in B-CLL it is also within the scope of the present invention 
to perform an Immunisation of a patient in need thereof against B-CLL, wherein the 
Immunisation generates an immune response in the patient which recognises a 
translational product of SEQ ID No 2, SEQ ID No 4, SEQ ID No 6, SEQ ID No 7, SEQ ID No 

30 8, SEQ ID No 9, SEQ ID No 10 and SEQ ID No 11. A preferred Immunotherapy is a 

vaccination against B-CLL by immunising an individual against a translational product of 
SEQ ID No 1 and/or SEQ ID No 5. In this way the individual builds up antibodies directed 
against said translational product and any developing B-CLL will be stopped by these 
antibodies. 

35 

Immunisation may be performed In various ways, such as by immunising said individual 
with at least one isolated polypeptide as defined the present invention and optionally 
adjuvants and carriers or Immunising with an expression construct capable of expressing 
an Isolated polypeptide according to the Invention in the cells (DNA vaccination). 

40 

Another method comprises peptide loading of dendritic cells, or ex vivo expansion and 
activation of T-cells, or inducing a CTL response that targets cells expressing the 
polypeptide encoded by SEQ ID No 1 and/or SEQ ID No 5. 
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Antibodies 

Antibodies against any of the polypeptides belonging to the novel class of proteins 
identified by the present inventors can be produced by any known method of 
immunisation. 

In one embodiment, the antibodies are produced in a non-human mammal, or in an insect. 
If antibodies are to be used for therapy in human beings they are preferably subsequently 
humanised. In one embodiment, the antibody is formulated into a single-chain antibody. 



10 In another embodiment, in particular for therapeutic purposes, the host organism is a 
human being and the antibody Is subsequently produced recombinantly In a non-human 
mammal, such as a mouse. The antibody may also be produced as a monoclonal antibody 
in a hybrldoma. One way of producing a monoclonal antibody Is described in US 5,681,729 
in which a human lymphocyte producing an antibody is generated by the steps. In the 

15 order mentioned, comprising 

1. transplanting human lymphocytes to a mouse lacking both functional T and B 
cells so that said human lymphocytes take in said mouse's body; 

2. immunizing said mouse with a desired antigen so as to generate human 
lymphocytes producing an antibody specific to said antigen; 

20 3. administering to said mouse an antiserum to mouse cells; 

4. recovering lymphocyte containing cells from said mouse; 

5. separating human lymphocytes from the recovered cells by centrlfugation; and 

6. separating said human lymphocytes producing said antibody. 

7. immortalizing said human lymphocytes 

25 8. cloning the obtained immortalized human-derived lymphocytes producing said 

antibody; and 

recovering a monoclonal antibody specific to said desired antigen from the cloned 
Immortalized human-derived lymphocytes. 



30 The antibodies of the present Invention may be provided as part of a pharmaceutical 
composition. Such a pharmaceutical composition may be used for treating cancer, 
preferably for treating leukaemia, more preferably for treating B-CLL leukaemia, more 
preferably for treating poor prognosis B-CLL leukaemia. 



35 

Use of antibodies in therapy 

Antibodies directed against epitopes can be used for prevention and/or therapy of for 
example B-CLL. Antigenic epitopes can be used In vaccines to stimulate an Immunological 
response in a mammal that is directed against cells having the B-CLL-associated epitope 
40 found in the AMB-1 protein(s) or functional equivalents. Antibodies directed against the 
antigenic epitopes of the invention can combat or prevent B-CLL. 

An antigenic epitope may be administered to the mammal in an amount sufficient to 
stimulate an immunological response against the antigenic epitope. The antigenic epitope 
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may be combined In a therapeutic composition and administered In several doses over a 
period of time that optimizes the immunological response of the mammal. Such an 
Immunological response can be detected and monitored by observing whether antibodies 
directed against the epitopes of the invention are present In the bloodstream of the 
5 mammal. 

Such antibodies can be used alone or coupled to, or combined with, therapeutically useful 
agents. Antibodies can be administered to mammals suffering from any B-CLL that 
displays the B-CLL-assoclated epitope. Such administration can provide both therapeutic 
10 treatment, and prophylactic or preventative measures. For example, therapeutic methods 
can be used to determine the spread of a B-CLL and lead to Its remission. 

Therapeutically useful agents include, for example, leukeran, adrlmycin, 

aminoglutethimide, aminopterin, azathloprlne, bleomycin sulfate, bulsulfan, carboplatin, 
15 carminomycin, carmustine, chlorambucil, cisplatin, cyclophosphamide, cyclosporine, 
cytarabldlne, cytosine arablnoslde, cytoxin dacarbazlne, dactlnomycin, daunomycin, 
daunorublcin, doxorubicin, esperamlclns, etoposlde, fluorouracil, Ifosfamlde, Interferon-a, 
lomustlne, melphalan, mercaptopurine, methotrexate, mitomycin C, mitotane, 
mltoxantrone, procarbazine HCI, taxol, taxotere (docetaxel), teniposide, thioguanlne, 
20 thiotepa, vinblastine sulfate, vincristine sulfate and vinorelbine. Additional agents include 
those disclosed in Chapter 52, Antineoplastic Agents (Paul Calabresi and Bruce A. 
Chabner), and the introduction thereto, pp.1202-1263, of Goodman and Gllman's "The 
Pharmacological Basis of Therapeutics", Eighth Edition, 1990, McGraw-Hill, Inc. (Health 
Professions Division). Toxins can be proteins such as, for example, pokeweed anti-viral 
25 protein, cholera toxin, pertussis toxin, ricin, gelonin, abrin, diphtheria exotoxin, or 
Pseudomonas exotoxin. Toxin moieties can also be high energy-emitting radionuclides 
such as cobalt-*6D, 1-131, 1-125, Y-90 and Re-186, and enzymatically active toxins of 
bacterial, fungal, plant or animal origin, or fragments thereof. 

30 Chemotherapeutic agents can be used to reduce the growth or spread of B-CLL cells and 
tumors that express the AMB-1 associated epitope of the Invention. Animals that can be 
treated by the chemotherapeutic agents of the invention include humans, non-human 
primates, cows, horses, pigs, sheep, goats, dogs, cats, rodents and the like. In all 
embodiments human B-CLL antigens and human Individuals are preferred. 

35 

Species-dependent antibodies can be used In therapeutic methods. Such a species- 
dependent antibody has constant regions that are substantially non-lmmunologlcally 
reactive with the chosen species. Such species-dependent antibody Is particularly useful 
for therapy because it gives rise to substantially no immunological reactions. The species- 
40 dependent antibody can be of any of the various types of antibodies as defined above, but 
preferably is mammalian, and more preferably Is a humanized or human antibody. 
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Compositions 

Therapeutically useful agents can be formulated Into a composition with the antibodies of 
the invention and need not be directly attached to the antibodies of the invention. 
However, In some embodiments, therapeutically useful agents are attached to the 
5 antibodies of the Invention using methods available to one of skill in the art, for example, 
standard coupling procedures. 

Compositions may contain antibodies, antigenic epitopes or trypsin-llke protease Inhibitors. 
Such compositions are useful for detecting the AMB-1 protein (for example antigenic 
10 epitopes) and for therapeutic methods involving prevention and treatment of B-CLLs 
associated with the presence of the AMB-1 (for example antigenic epitopes). 

The antibodies, (and for example antigenic epitopes and protease inhibitors) can be 
formulated as pharmaceutical compositions and administered to a mammalian host, such 
15 as a human patient in a variety of forms adapted to the chosen route of administration. 
Routes for administration include, for example, intravenous, Intra-arterlal, subcutaneous. 
Intramuscular, Intraperitoneal and other routes selected by one of skill in the art. 

Solutions of the antibodies, (and for example antigenic epitopes and protease inhibitors) 
20 can be prepared in water or saline, and optionally mixed with a nontoxic surfactant. 
Formulations for intravenous or intra-arterial administration may include sterile aqueous 
solutions that may also contain buffers, liposomes, diluents and other suitable additives. 

The pharmaceutical dosage forms suitable for Injection or infusion can include sterile 
25 aqueous solutions or dispersions comprising the active ingredient that are adapted for 
administration by encapsulation in liposomes. In all cases, the ultimate dosage form must 
be sterile, fluid and stable under the conditions of manufacture and storage. 

Sterile Injectable solutions are prepared by Incorporating the antibodies, antigenic epitopes 

30 and protease inhibitors in the required amount in the appropriate solvent with various of 
the other Ingredients enumerated above, as required, followed by filter sterilization. 

Polynucleotides 

In a still further aspect the invention relates to an isolated polynucleotide selected from 
35 the group consisting of: 

i) a polynucleotide comprising nucleotides of SEQ ID No 5, 

ii) a polynucleotide encoding a polypeptide having the amino acid sequence of SEQ ID No 
3, 

40 lil) a polynucleotide, the complementary strand of which hybridises, under stringent 
conditions, with a polynucleotide as defined in any of i) and ii). 

iv) a polynucleotide which is degenerate to the polynucleotide of ill), and 

v) the complementary strand of any such polynucleotide. 
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The polypeptides encoded by the polynucleotides may furthemnore 

a) have at least 60 % sequence identity with the amino acid sequence of SEQ ID No 3 
and have Interleukin or cytokine activity, 

b) be recognised by an antibody, or a binding fragment thereof, which is capable of 

5 recognising an epitope, wherein said epitope is comprised within a polypeptide having 
the amino acid sequence of SEQ ID No 3; and/or 

c) be competing with a polypeptide having the amino acid sequence as shown in SEQ ID 
No 3 for binding to at least one predetermined binding partner such as a cytokine 
receptor. 

10 

Specific examples of fragments of SEQ ID No 1 include the nucleotide sequence selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No:15, 
SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18. 

15 Hybridisation 

The entire nucleotide sequence of the coding sequence of SEQ ID No 1 and/or SEQ ID No 5 
or portions thereof can be used as a probe capable of specifically hybridising to 
corresponding sequences. To achieve specific hybridisation under a variety of conditions, 
such probes include sequences that are unique and are preferably at least about 10 

20 nucleotides in length, and most preferably at least about 20 nucleotides in length. Such 
probes can be used to amplify corresponding sequences from a chosen organism or 
individual by the well-known process of polymerase chain reaction (PCR) or other 
amplification techniques. This technique can be used to isolate additional nucleotide 
sequences from a desired organism or as a diagnostic assay to determine the presence of 

25 the coding sequence in an organism or individual. Examples Include hybridisation screening 
of plated DNA libraries (either plaques or colonies; see e. g. Innis et al. (1990) PCR 
Protocols, A Guide to Methods and Applications, eds.. Academic Press). 

The terms "stringent conditions" or "stringent hybridisation conditions" include reference to 
30 conditions under which a probe will hybridise to its target sequence, to a detectabiy 

greater degree than other sequences (e. g., at least twofold over background). Stringent 
conditions are target sequence dependent and will differ depending on the structure of the 
polynucleotide. By controlling the stringency of the hybridisation and/or washing 
conditions, target sequences can be identified which are 100% complementary to a probe 
35 (homologous probing). 

Alternatively, stringency conditions can be adjusted to allow some mismatching in 
sequences so that lower degrees of similarity are detected (heterologous probing). 

40 Generally, probes for hybridisation of this type are In a range of about 1000 nucleotides in 
length to about 250 nucleotides in length. 

An extensive guide to the hybridisation of nucleic acids is found in Tijssen, Laboratory 
Techniques in Biochemistry and l^lolecular Biology-Hybridization with Nucleic Acid Probes, 
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Part I, Chapter 2, "Overview of principles of hybridization and the strategy of nucleic add 
probe assays", Elsevier, New York (1993); and Current Protocols in Molecular Biology, 
Chapter 2, Ausubel, et al., Eds., Greene Publishing and Wiley-Intersclence, New York 
(1995). See also Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2nd ed. 
5 Cold Spring Harbor Laboratory, Cold Spring Harbor, N. Y.). 

Specificity is typically the function of post-hybridisation washes, the critical factors being 
the ionic strength and temperature of the final wash solution. 

10 Generally, stringent wash temperature conditions are selected to be about 5°C to about 
2<>C lower than the melting point (Tm) for the specific sequence at a defined Ionic strength 
and pH. The melting point, or denaturation, of DNA occurs over a narrow temperature 
range and represents the disruption of the double helix Into Its complementary single 
strands. The process is described by the temperature of the midpoint of transition, Tm, 

15 which Is also called the melting temperature. 

Formulas are available In the art for the determination of melting temperatures. 

Preferred hybridisation conditions for the nucleotide sequence of the invention include 
20 hybridisation at 42«>C in 50% (w/v) formamlde, 6X SSC, 0.5% (w/v) SDS, 100 mg/ml 
salmon sperm DNA. Exemplary low stringency washing conditions include hybridization at 
42°C in a solution of 2X SSC, 0.5% (w/v) SDS for 30 minutes and repeating. Exemplary 
moderate stringency conditions include a wash in 2X SSC, 0.5% (w/v) SDS at 50°C for 30 
minutes and repeating. 

25 

Exemplary high stringency conditions include a wash in 2X SSC, 0.5% (w/v) SDS, at 65^C 
for 30 minutes and repeating. Sequences that correspond to the AMB-1 gene or fractions 
thereof according to the present invention may be obtained using al! the above conditions. 
For purposes of defining the invention, the high stringency conditions are used. 

30 

Mutations 

Finally, the invention provides a method for determining an Increased or decreased 
predisposition for B-CLL comprising determining in a biological sample from an individual a 
germline alteration In a target nucleic acid sequence comprising 150,000 nucleotides, said 

35 target nucleic acid sequence comprising at least 10 nucleotides of SEQ ID No 1 and/or SEQ 
ID No:5. This aspect Is based on the finding of the importance of the expression product of 
SEQ ID No 1 and/ SEQ ID No:5, and the complete absence of any detectable expression 
product of SEQ ID No 1 and/or SEQ ID No:5 In healthy tissue and in patients with good 
prognosis B-CLL. It is highly likely that the difference is caused by a germline alteration. A 

40 germline alteration can be targeted by gene therapy methods and by the methods 
provided in the present Invention. 

Preferably, said predisposition is a predisposition for poor prognosis of B-CLL. 
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Examples 

Example 1: cDNA cloning 

By Differential Display (Pardee et aL, 1992, J0rgensen et al., 1999) part of a gene 
(hereafter referred to as AMB-1) was found that Is expressed In unmutated B-CLL patients 
5 with poor prognosis. This gene Is not found In the mutated B-CLL patients. When AMB-1 
was sequenced and aligned to known sequences in GenBanl<, perfect homology was found 
to 225 base pairs (bp) of human genomic DNA from chromosome 12. 

RNA was prepared using the RNeasy kit from Qiagen^ as described by the manufactor 
10 (QIagen, Hllden, Germany). RNA was prepared from patients with B-CLL without hyper 
mutation who, by PCR analysis, using primers FDP5 (CCTTTATGTGTGTGACAAGTG) and FIO 
(ATCCAGCCAGGATGAAATAGAA), showed a high level of the resulting PCR fragment. Poly- 

A+ RNA was Isolated from total RNA by the "MlcroPoly(A)Purlst" kit from Amblon, as 
described by the manufactor (Amblon, Inc., Texas, USA). Cloning-ready cDNA was 

15 prepared from 8 pg poly-A+ RNA using the "ZAP Express® XR Library Construction Kit" 
from Stratagene as described by the manufactor (Stratagene, San Diego, USA). The cDNA 
was size fractionated and two size fractions (fractlon-1: > 2500 bp and fraction-2 300- 
2500 bp) were independently ligated to pre-dlgested lambda Zap vectors and packed into 
phage particles as described by the manufacture (Stratagene, San Diego, USA). The titer 

20 was determined for each library and 200,000 pfu of from the fraction-1 library were plated 
onto two 22x22 cm screening plates (100,000 pfu on each plate) and 750,000 pfu of the 
fraction-2 library were plated on five 22x22 cm screening plates (150,000 pfu on each) as 
described by Stratagene, San Diego, USA. The plates were Incubated at 37°C for 18 hours 
and the plaques transferred to replica nylon filters (Amersham) and denatured and 

25 renatured to allow hybridisation. Ail procedures were made as described by the 

manufactures (Stratagene, San Diego, USA & Amersham Biosciences, Buckinghamshire, 
UK). 

The filters were screened by independent hybridisations with alpha[22p]-dATP-labelled 

30 DNA fragments; alpha[^2p]-dATP was purchased from Amersham Biosciences, 

Buckinghamshire, UK. Between succeslve hybridisations, the old probe was removed by 
Incubation for 20 min in 21 go-lOO^'C water containing 0.1% SDS . The DNA fragments 
used as probes were (all positions relate to sequence ID # X): 1) pos. 48978-49250; 2) 
pos. 50011-51591; 3) pos. 51461-52182; 4) pos. 51901-52589; 5) pos. 53121-56521; 6) 

35 pos. 58163-59408. All hybridisations and washes were made according to the indtructlons 
from Stratagene, San Diego, USA and Amersham Biosciences, Buckinghamshire, UK; 
Washing was done at a high stringency (0.1 x SSC at SS^'C for 20 mIn). 

A total of 38 plaques that showed a positive response from one or more of the screenings, 
40 were excised from the screening plates and grown as plasmids as described (Stratagene, 
San Diego, USA). 



wo 2004/046376 



PCT/DK2003/000794 



33 

A total Of 8 cDNAs were identified by cDNA cloning or by a combination of cDNA cloning, 
PCR analysis and RACE (rapid amplification fo cDNA ends-polymerase chain reaction) using 
the the SMART RACE cDNA amplification kit (Ciontech, Palo Alto, CA) according to the 
manufacturer's instructions. 



Example 2: Bioinformatic analysis of AMB-1. 

The DNA and protein sequence data bases (GenBank and EBI) have been searched for 
sequences with similarity to AMBl. There is no significant DNA sequence similarity to any 
known gene. In particular, the coding region of the AMBl mRNA (SEQ ID no 3) is not 

10 present in any known EST. The only significant match to the complete mRNA sequences 
and the DNA sequence of the putative coding region were BAC clones derived from the 
region on human chromosome 12 where the gene is located. The "AMB-1 gene" had not 
been annotated as a gene on the chromosome. Searches with the peptide sequence In the 
sptrnr data base of peptide sequences (includes Sprot and nrtrembi) showed a low 

15 similarity to putative intron maturases from cloroplasts and to bovine IL4. The percentage 
similarity to both maturases and bovine IL4 was low (25.6% and 30.3%, respectively) and 
the similarity to maturases only included a match to 75 amino acids of the much larger 
maturases. In contrast, the match to bovine IL4 extended over the full peptide sequence. 
IL4, and other 4-hellcal cytokines. Include a leader peptide sequence (signal peptide) 

20 allowing the proteins to be secreted. The AMBl peptide sequence includes a N-terminal 
peptide sequence with similarity to signal peptide sequences, however, it Is not a typical 
sequence. 

A 3D search has been performed, where a peptide sequence Is searched for similarity to 

25 known protein or peptide 3D-structures. The two best matches were the thioredoxin fold 
and the human 4-helical cytokine IL4 (Rg. 5), The two matches had almost similar 
probability scores (2.88 and 3.05, respectively). Searches with 4-helical cytokine peptide 
sequences (IL4, IL3, XL13 and GM-CSF) revealed that all could be folded Into both a 4- 
helical cytokine structure and the thioredoxin fold. Alignment based on the structural 

30 similarity between IL4 (dliara) and AMB-1 is shown in Fig. 5. Thus, the AMBl peptide 
sequence share this property with 4-helical cytokines. The structural similarity is not 
perfect (Fig. 6) and there are no obvious glycosylatlon sites In the AMBl sequence, 
however, the similarity is significant. Alignment of the AMBl peptide sequence with the 
sequences of IL4, IL3, IL13 and GM-CSF, based on their structures, showed very little 

35 sequence conservation but a high degree of structural conservation (Fig. 7). Based on this 
alignment, AMBl has similarities to all the 4-hellcal cytokines, and the length of AMBl and 
the position of gaps in the alignment could suggest a higher similarity to e.g. IL13, but 
searches at 3D-PSSM only identified a significant similarity to the structure of IL4, not 
IL13, 11-3 or GM-CSF. However, the search algorithms are not perfect and may therefore 

40 not detect a possible low structural similarity. 
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Example 3: Differential expression of AMB-1 



Patient material 

Blood samples were collected from newly diagnosed untreated patients with B-CLL. 
5 Mononuclear cells were Isolated by Lymphoprep separation (Nycomed Pharma, Oslo, 
Norway), and the percentage of CD5+CD20+ B-CLL cells In the mononuclear fraction was 
>90% In all samples as determined by flow cytometric analysis. 

Isolation of RNA and conversion to cDNA. 

10 Material for RIMA production was Isolated mononuclear ceils from B-CLL patients or 

mononuclear cells from lymphoprep separated buffy coats from normal donors. Total RNA 
was isolated from 5x10^ or more cells using the QIAamp RNA Blood Mini kit (Qiagen, 
Valencia, CA) with DNAse treatment. RNA (lug) was converted to cDNA by incubation with 
a mixture of random-primers (l|jg) and T24-primer (Ipg) for 5 minutes at 70^C, After 

15 cooling on Ice, the reaction mixture was added to a final volume of 25pl containing 30U of ^ 
AMV Reverse Transcriptase HC (Promega, Madison, WI, USA), Ix First Strand Buffer 
(50mM Tris-HCi, pH 8.3, 50mM KCI, lOmM MgCb, lOmM DTT, O.SmM spermidine), 2.5mM 
of each dNTP and 60U rRNasIn ribonuciease inhibitor (Promega, Madison, WI, USA), The 
reaction was performed for 60 minutes at 37^C. 

20 

Determination of somatic hypermutatlon status 

Two pi of cDNA was amplified using a GeneAmp PGR System 2700 (Applied Blosystems, 
Warrington, UK) with a 40 pmoi specific upstream primer corresponding to 1 of the 6 
human VH family leader sequences (VHl: 5'-CCATGGACTGGACCTGGAGG-3', VH2: 5'- 

25 ATGGACATACI IIGI ICCAGC-3', VH3: 5'-CCATGGAGTTTGGGCTGAGC-3', VH4: 5'- 
ATGAAACACCTGTGGTTCTT-3', VH5: 5'-ATGGGGTCAACCGCGATCCT-3', VH6: 5'- 
ATGTCTGTCTCCTTCCTCAT-3') and a 40 pmol downstream primer {Cp:5'- 
GAGGCrCAGCGGGAAGACCTT-3' or CY:5'-GGGGAAGACCGATGGGCCCCr-3') corresponding 
to a consensus sequence of the constant region of IgM or IgG respectively. The Reverse 

30 Transcription (RT)-PCR reaction contained IxPCR buffer (lOmM Tris-HCi, pH 9.0, 50mM 
KCI, 0.1% Triton X-100), 2.5mM MgCfz, 0.2mM of each dNTP and 1.5U Taq DNA 
Polymerase (Promega, Madison, WI, USA) In a final volume of lOOpl. The RT-PCR was 
performed under the following conditions: 1 cycle of 940C for 5 minutes, 30 cycles of 
denaturation at 940C for 30 sees, annealing at 62oc for 30 sec. and extension at 720C for 

35 30 sec, and a final extension at 72^C for 7 minutes. The RT-PCR products were analysed 
on 2% agarose gels and sequenced In an HBI Prism 310 Genetic Analyzer (Perkin Elmer, 
Foster City ,CA, USA) using the BigDye Terminator Cyde Sequencing Ready Reaction kit 
(Applied Blosystems, Warrington, UK) following the manufacturer's Instructions. 

40 Sequences obtained from each sample were compared to germ line sequences in the V 
base sequence directory (I.M. Tomlinson, MRC Center for Protein Engineering, Cambridge, 
UK) using BLAST, and the closest germ line sequence was assigned. A gene sequence was 
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considered to be mutated if it had equal or more than 2% sequence aiterations when 
compared to the closest published germ line sequence. 

RT-PCR that amplifies the Exon 2-Exon 3 junction 
5 To evaluate the mRNA expression pattern of AI^Bl in unmutated and mutated B-CLL 
patients RT-PCR was performed. Exon-overlapping oligonucleotide primers were: 5'- 
ATCCAGCCAGGATGAAATAGAA-3' and 5'-CACTTGTCACACACATAAAGG-3'. The RT-PCR was 
performed in a GeneAmp PCR System 2700 thermai cycler with an Initial denaturatlon at 
940c for 2 minutes, 40 cycles of 960C for 25 sec, 620C for 25 sec. and 72oc for 90 sees, 

10 and a final extension at 720C for 5 minutes. The reactions contained 2pl cDNA, Ix DDRT- 
PCR buffer (lOmM TrIs-HCI, pH 8.3, 50mM KCI, 1.8mM MgClz, 0.1% Triton X-100, 0.005% 
gelatine), 0.25mM of each dNTP, 30 pmol of each primer and 0.5U Taq DNA Polymerase 
(Promega, Madison, WI, USA) in a 30pl final volume. RT-PCR products were analyzed by 
gelelectrophoresis on 2% agarose gels and visualized with a Gene Genius Bio Imaging 

15 System (Syngene, Frederick, MD) after staining with ethldlum bromide. An actin control 
RT-PCR was performed using the primers: 5'-TGACGGGGTCACCCACACrGTGCCCATCTA-3' 
and 5'-CTAGAAGCATTTGCGGTGGACGATGGAGGG-3'. 

RT-PCR that amplifies the Exon 1-Exon 3 junction: 

20 To evaluate the mRNA expression pattern of AMBl in unmutated and mutated B-CLL 
patients RT-PCR was performed. Exon-overiapping oligonucleotide primers were: 5 - 
AGACGGCTCrCACC AATAAG-3' and 5'-CACTrGTCACACACATAAAGG-3'. The RT-PCR was 
performed In a GeneAmp PCR System 2700 thermal cycler with an initial denaturation at 
940c for 2 minutes, 40 cycles of 960C for 25 sec, 620C for 25 sec. and 720C for 90 sees, 

25 and a final extension at 720C for 5 minutes. The reactions contained 2^1 cDNA, Ix DDRT- 
PCR buffer (lOmM Tris-HCI, pH 8.3, 50mM KCI, l.SmM MgClj, 0.1% Triton X-100, 0.005% 
gelatine), 0.25mM of each dNTP, 30 pmol of each primer and 0.5U Taq DNA Polymerase 
(Promega, Madison, WI, USA) In a 30|il final volume. RT-PCR products were analyzed by 
gel electrophoresis on 2% agarose gels and visualized with a Gene Genius Bio Imaging 

30 System (Syngene, Frederick, MD) after staining with ethldium bromide (Continental Lab 
Products, San Diego, USA). 

Statistical analysis 

Statistical significance of the correlation between somatic hypermutatlon status and AMBl 
35 expression was analyzed using Fisher's exact test. 

Noitiiern blotting. RNA from spleen, bone marrow and colon was purchased from 
Clontech. The AMBl probe was an 896 base pair fragment (57661-56766) obtained by RT- 
PCR as described above with the primers 5'-TCACCTGGGAGCTCAGAGGA-3' and 5'- 
40 GTGATCCTGGGAGAATCTCr-3'. For Nortihern blotting, 5 [iQ of RNA was run on a 1% 
agarose-gel with 6% formaldehyde dissolved In 1 x MOPS (20 mM 3-(N-morpholino)- 
propane-sulfonic acid, 5 mM sodium acetate, 1 mM EDTA, pH 7.0) for size separation. The 
presence of equal amounts of RNA in each lane was ensured by ethldium bromide staining. 
The RNA was transferred to a Hybond-N membrane (Amersham, Little Chalfont, UK) by 
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capillary blotting and fixed by UV-irradiatlon. The filters were pre-hybridized for 1-2 hours 
at 42°C In 6 ml ULTRAhyb (Ambion, Austin, TX, USA) preheated to 68«C and hybridized 
overnight at 42X after addition of further 4 ml containing the ^^P-labeled probe and 
sheared salmon sperm DNA (10 pg/nnl). The membranes were washed for 2 x 15 min. at 
5 42°C in 2 X BSC, 0.1% SDS followed by 1 x 15 min, in 0.2 x SSC, 0.1 % SDS and 2 x 15 
min. in 0.1 X SSC, 0.1 % SDS at 42°C. The blot was developed and quantified by a 
phosphoimager. The sizes of the mRfSlAs were determined by reference to 185 and 28S 
ribosomal RNA, which were visualized by ethldium bromide staining. The AMBl probe used 
for hybridization was radiolabeled with [a-^^P] dCTP using the Random Primers DNA 
10 Labeling System (Gibco BRL). 

Dot blot of multiple tissue expression (MTE) array. An MTE array (Clontech, Palo 
Alto, CA, USA) was hybridised to AMBl at 65*C in ExpressHyb (Clontech) supplemented 
with sheared salmon sperm DNA (7.5 pg/ml) and human Cot-1 DNA (1.5 pg/mi) according 

15 to the manufacturers recommendations. The tissue types represented on the MTE array 
are shown in Figure 11. Following hybridisation the filter was washed 5 x 20 min. at 65°C 
in 2 X SSC (1 X SSC =150 mM NaCI, 15 mM sodium citrate, pH 7.0), 1% SDS and 2 x 20 
min at 65°C In 0.1 x SSC, 0.5% SDS. The blot was developed and quantified by a 
phosphoimager (Fuji Imager Analyzer BAS-2500, Image Reader ver. 1.4E, Image Gauge 

20 ver. 3.01 software, Fuji, Stocl<holm, Sweden). The membranes were stripped by boiling in 
0.5% SDS for 10 min. before rehybrldization. The probe used for hybridization were 
radiolabeled with [a-^^P] dCTP using the Random Primers DNA Labeling System (Gibco 
BRL, Rockville, ML, USA). 

25 Results 

Based on the known sequence of the AMB-1 cDNA RT-PCRs with primers that extend 
across the Exon 2-Exon 3 junction and the Exon 1- Exon 3 junction were set up. As shown 
in Figure 10, where the Exon 2- Exon 3 junction has been amplified, AMB-1 Is expressed in 
30 the unmutated patients (UPNl-8) while no expression of AMB-1 Is seen in mutated 
patients {UPN9-16). 

Northern blot analysis was performed to determine the size of AMB-l's mRNA transcript. 
As shown in Figure 3 the probe identifies transcripts In the samples from the three patients 

35 without somatic hypermutatlon (UPNl, UPN4 and UPN7). However, the probe does not 
recognise any transcripts from the patients with somatic hypermutatlon (UPN9, UPNIO, 
UPN13, UPN21) or the various cell lines and tissue samples. Similar results were obtained 
when cell lines and tissue samples were investigated by RT-PCR (results not shown). Dot 
blot analysis on a purchased filter with 96 different RNA samples (Figure 8) only revealed 

40 specific binding to the total DNA control dot, but not to any specific tissue (results not 
shown). Thus AMB-1 is only expressed in B-CLL cells without hypermutatlon or AMB-1 is 
expressed at extremely low levels in other tissues. 
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We next tested the predictive value, in terms of IgVR mutational status, of expression of 
AMB-1 in 29 consecutive newly diagnosed patients. At present 13 somatically unmutated 
and 16 somatically mutated patients liave been included in our prospective patient 
database. The sensitivity and specificity for expression of AMB-1 In predicting mutational 
5 status Is well above 90% (p<0.0001), which is at the level obtained by sequencing. 



Example 4. Investigation of the prognostic significance of AMB-1 in terms of patient 
survival 

10 To obtain information about the ability of AMB-1 to predict survival or time to progression 
for B-CLL patients survival curves were made. For each patient the following data were 
collected: clinical stage at time of diagnosis (Ral and Binet staging), date of diagnosis, date 
of first time of treatment and last follow up date. Time to treatment (progression free 
survival) and survival time were calculated based on these dates using the Microsoft Excel 

15 software. Survival times and progression free survival times were plotted be the Kaplan- 
Meier Method and compared using the log-rank test (Prism 3.0 Graph Pad software). 



34 newly diagnosed, untreated B-CLL patients were investigated as described above. As 
shown In Figure 2a-d AMB-1 expression is a good predictor of B-CLL patient survival and 
20 B-CLL patient time to progression. 

Example 5: siRNA assays. 

Materials and Methods: 

Frozen cells from unmutated or mutated B-CLL patients prulfied and characterized for 
mutational status as previously described were thawed and FIcoll separated to obtain the 

25 live cells. Cells were counted and resuspended at 2x10^ cells/ml In RPMI 1640 medium 
with glutamax-1 (RPMI, GIBCO, Paisley, UK). 1x10^ cells {500\x\ of cell suspension) were 
incubated on ice for 10 minutes with the appropriate sIRNA or combination of siRNAs at a 
final concentration of lOOnM each. The suspension was electroporated using a 4mm 
cuvette (Molecular BioProducts, San Diego, CA, USA) in a BioRad Gene Pulser. Following 

30 electroporatlon the ceils were Incubated on Ice for 10 minutes, washed once in RPMI with 
20% FCS and lOOunlts/ml Penicillin and 100|ig/ml Streptomycin (GIBCO, Paisley, UK), 
resuspended at a concentration of 4x10^ cells/ml in the same medium and incubated for 
48 hours. For each patient the optimal voltage allowing for maximum sIRIMA uptake was 
determined by making a voltage curve (300-450V, 960jiF) using a FITC-labeled unspecific 

35 probe (Xeragon, Qiagen, Hilden, Germany) at a final concentration of 200nM and the 
uptake was followed by flow cytometry. 



The siRNAs were produced using the SilencerTM sIRIMA Construction Kit, (Amblon, Texas, 
USA) according to the manufacturers instructions. SiRNAs had the following target 
40 sequences: 



sIRNA name: 



mRNA target sequence: 
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Exon 1 A 


5'- AAUAAGGGCAGGCAUCADCCA -3' 


SEQID No: 19 


Exon 1 B 


5'- AAUUACACOGCCAGGOUUCCU -3' 


SEQ ID No:20 


Exon 2 A 


5'- AAUUCAUUCACAAUGAUUGCU -3 


SEQ ID No:21 


Exon 2 B 


5 ' - AAUDUCUCOUGGGUAAUUCAG-3 ' 


SEQ ID No: 22 


Exon 3 (DDend) A 


5'- AAAADCAGAAUCUGCGCAGCA -3' 


SEQ ID No:23 


Exon 3 (DDend) B 


5'- AAUGAOGAUGGGAAGAAGGAA -3' 


SEQ ID No:24 


CDS A 


5'- AAACUOAGUAADDGAGUGUGA 3* 


SEQ ID No:25 


CDS B 


5'- AAUAUGUCACUUUCAUAAAGC -3' 


SEQ ID No: 26 


Transcript on -strand A 


5'- AAUGAUGAUGGGAAGAAGGAA -3' 




Transcript on -strand B 


5'- AAACUAUGAGAUUUCAGAAGG -3' 





Table 1: Overview of the siRNAs used in electroporatlon experiments 



After 48 hours of incubation the various B-CLL samples were counted and ilve and dead 
5 cells were distinguished by nigrosin exclusion (0.1% In NIgrosin In PBS from Fluka, Buchs, 
Swltzeriand). 

Results: 

Following 48 hours of incubation with control GFP siRNA or siRNAs against various regions 
10 of the cDNAs (see table 1), the viable cells and dead cells were counted based on the 
ability of the cells to exclude nigrosin. As shown in table 2, in 3 out of 4 patients 
introduction of siRNAs against Exon 1, Exon 2 and Exon 3 resulted in an increase of dead 
cells as compared to the controls (no sIRNA or GFP siRNA). 



15 



Treatment: 


UPN 67 


UPN 66 


UPN 62 


UPN 73 


No SiRNA 


33.0% 


19.7% 


30.9% 


20.2% 


Control SiRNA 


28.4% 


26.7% 


24.0% 


16.0% 


CDS SiRNA 


23.7% 


25.5% 


21.0% 


13.3% 


Exl-3 sIRNA 


47.3% 


54.1% 


17.2% 


29.1% 



Table 2: Percent dead cells of total cells following electroporatlon and incubation with 
siRNAs. 



20 Example 6: Identification of possible cytogenetic aberrations near or within the region 
encoding AMB-1 on chromosome 12. 

Rationale: The limited expression profile of AMB-1 suggests that it may be a result of a 
genetic aberration (e.g. deletion, translocation or alternative splicing) or that the promoter 
region controlling the expression of AMB-1 is uniquely activated in unmutated B-CLL. 
25 Another gene is situated about 200.000 bases upstream of the AMB-1 gene (SEQ ID No 1) 
on chromosome 12 and the inventors we have determined that this gene Is expressed at 
equal levels In unmutated and mutated patients. 
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Methods: Using primers, initially spaced about 20.000 bp apart; this region on 
chromosome 12 is characterised in unmutated B-CLL patients. If genetic aberrations within 
the region are detected by PGR analysis of chromosomal DNA, detailed molecular genetic 
5 studies using RSH, microsatellite analysis and Southern blotting will be employed. The 
whole region from unmutated patients Is sequenced. 

Example 7: Polyclonal antibodies 

10 

Production of polyclonal antibodies: 

Synthetic peptides CDLETNSEINKLIIYLFSQNNRIRF and CQVSKKHIIYSTFLSKNF were 
synthesized and conjugated to KLH (K.J. Ross- Petersen Aps, Holte, Denmark). Polyclonal 
antibodies were produced by Immunization of rabbits with these conjugated peptides by 
15 DAKO {DAKO Cytomatton A/S, Glostrup, Denmark). 

Testing of polyclonal antibodies: 

At the present time we have produced polyclonal antibodies from three rabbits that have 
been Immunized with peptides representing predicted Immunogenic regions of the protein 

20 that can be predicted from the CDS sequence (SEQ ID No: 17 ) (cDNA 4). The antibodies 
are tested in various ways. The proposed reading frame of CDS (SEQ ID No: 17) is 
expressed in 293 cells and the binding of antibodies to lanes on a western blot with non- 
transfected 293 cells versus transfected 293 cells are compared. The size of the band in 
the lanes with transfected 293 cells is compared to the size of western blot bands in lanes 

25 with proteins from B-CLL patients. Specificity of the bands is secured by peptide blocking 
experiments. 



30 



Additionally the polyclonal antibodies are tested in B-CLL immunoprecipitatlon experiments 
where the antibodies are used to Immunoprecipitate the protein produced from the CDS 
sequence and the immunoprecipitates are analyzed by western blotting. 
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Example 8: Assay for the biological activity of 4-helix cytokines. 

The assay Is based on the use of a cytokine dependent or stimulated cell line, for example 
an IL4 dependent ceil line ("Optimisation of the CT.h4S bioassay for detection of human 

5 lnterleukin-4 secreted by mononuclear cells stimulated by phytohaemagiutinin or by 
human leukocyte antigen mismatched mixed lymphocyte culture", Petersen, S.L,, Russell, 
CA., Bendtzen, K. & Vlndel0v, L.L., Immunology Letters 84 (2002) 29-39). Other 
examples of cytokine dependent cell lines include IL13 dependent cell lines. A list of 
commercially available cytokine dependent cell lines is disclosed in the general part of the 

10 description. These can alt be used for assessing cytokine activity. The most preferred cell 
lines are those that are IL4 dependent. 

The assay can be performed In two ways. The first assay comprises providing 
recomblnantly produced AMBl protein or a functional equivalent thereof and determine the 
15 proliferation rate of the cell line. The proliferation rate (either rate of proliferation or ± 
proliferation) can be compared to the proliferation rate of the cell line exposed to IL4 or 
another known 4-heilcal cytokine or Interleukin. 

If a positive result Is obtained with a polypeptide an assay will be performed on the same 
20 cell line with the IL4 receptor blocked. This will check whether the stimulus goes through 
IL4R. 

The second assay Is based on transfection of a gene encoding a 4-hellx cytokine according 
to the invention into cytokine dependent cells and observe proliferation or non-proliferation 
25 during transient expression. 

Example 9: Cytokine receptor binding assays 

The following is a description of the layout of a cytokine receptor binding assay used to 
determine the cytokine activity of the 4-helix cytokines according to the present invention. 

30 

The assays can be performed with any cytokine receptor. Preferred receptors include but Is 
not limited to the receptors for IL4 IL13, IL3, and GM-CSF. 

The ability of recombinant cytokine receptor to bind to 4-hellcai cytokine is assessed In a 
35 competitive binding ELISA assay as follows. Purified recombinant cytokine receptor (IL4, 
IL13, Il_3 or GM-CSF receptors) (20 pg/ml In PBS) is bound to a Costar EIA/RIA 96 well 
microtiter dish (Costar Corp, Cambridge Mass., USA) in 50 pL overnight at room 
temperature. The wells are washed three times with 200 pL of PBS and the unbound sites 
blocked by the addition of 1% BSA in PBS (200 pl/well) for 1 hour at room temperature. 
40 The wells are washed as above. Biotinylated AMB-1 (1 pQ/ml serially diluted in twofold 
steps to 15.6 ng/mL; 50 pL) is added to each well and Incubated for 2.5 hours at room 
temperature. The wells are washed as above. The bound biotinylated AMB-1 is detected by 
the addition of 50 pi/well of a 1:2000 dilution of streptavidin-HRP (Pierce Chemical Co., 
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Rockford, III.) for 30 minutes at room temperature. The wells are washed as above and 50 
pL of ABTS (Zymed, Calif.) added and the developing blue color monitored at 405 nm after 
30 min. The ability of unlabelled 4-helical cytokine to compete with biotinyiated AMB-l, 
respectively^ is assessed by mixing varying amounts of the competing protein with a 
5 quantity of biotinyiated AMB-1 shown to be non-saturating (i.e., 70 ng/mL; 1.5 nM) and 
performing the binding assays as described above. A reduction in the signal (Abs 405 nm) 
expected for biotinyiated 4-helical cytokine indicates a competition for binding to 
immobilised cytokine receptor. 

10 The above identified assays can be used to identify 4-helical cytokines with similar binding 
affinities as AI^B-1 (SEQ ID No. 3). In the competitive binding assays biotinyiated IL4, 
IL13, II-3, or GM-CSF can be used to identify 4-helical cytokines which can compete with 
these cytokines. 
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Claims 

1. A method for establishing a diagnosis of a subtype of B-cell chronic lymphocytic 
leukaemia (B-CLL) In a individual comprising detecting the presence or absence of at least 
5 one expression product, wherein said at least one expression product comprise a 

nucleotide sequence selected from the group consisting of SEQ ID SEQ ID No: 12, SEQ ID 
No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 In a 
biological sample isolated from the Individual. 

10 2. A method for establishing the prognosis of B-CLL In a individual comprising detecting 
the presence or absence of at least one expression product, Nwherein said at least one 
expression product comprise a nucleotide sequence selected from the group consisting of 
SEQ ID SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ 
ID No: 17 and SEQ ID No: 18 in a biological sample Isolated from the individual. 

15 

3. A method for determining whether an individual has a B-CLL sub-type with poor 
prognosis, the method comprising determining the level of an expression product which 
comprise a nucleotide sequence selected from the group consisting of SEQ ID No: 12, SEQ 
ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 

20 of said individual, and indicating the Individual as having a B-CLL sub-type with poor 
prognosis if the level of the expression product is at or beyond a discriminating value and 
Indicating the Individual as not having a B-CLL sub-type with poor prognosis If the level of 
the expression product is not at or beyond the discriminating value, the discriminating 
value being a value which has been determined by measuring the level of the expression 

25 product which comprise a nucleotide sequence selected from the group consisting of SEQ 
ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and 
SEQ ID No: 18 in both a healthy control population and a population with l<nown B-CLL 
sub-type with poor prognosis, thereby determining said discriminating value which 
Identifies the B-CLL sub-type population having a poor prognosis. 

30 

4. The method according to claim 3, wherein the individual is a member of an unselected 
population. 

5. The method according claim 3, wherein the individual is a member of a population 
35 already Identified as having a B-CLL sub-type with a poor prognosis. 

6. The method according to any of claims 3-5, wherein the determination is performed at 
several time points at intervals as part of a monitoring of a cancer patient after or during 
the treatment for primary cancer. 

40 

7. The method according to any of the preceding claims, wherein the expression product is 
a transcriptional product. 
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8. The method according to claim 1, wherein the at least one transcriptional product is 
selected from the group consisting of 5EQ ID No 2, SEQ ID No 4, SEQ ID No 6, SEQ ID No 
7, SEQ ID No 8, SEQ ID No 9, SEQ ID No 10 and SEQ ID No 11. 

5 9. The method according to claim 7, wherein said at least one transcriptional product 
comprise a nucleotide sequence selected from the group consisting of SEQ ID SEQ ID 
NoilS. 

10. The method according to claim 7, wherein said at least one transcriptional product 
10 comprise a nucleotide sequence selected from the group consisting of SEQ ID SEQ ID 

No:16. 

11. TTie method according to claim 7, wherein said at least one transcriptional product 
comprise a nucleotide sequence spanning the junction between Exon-2 and Exon-3. 

15 

12. The method according to claim 11, wherein the nucleotide sequence spanning the 
junction between Exon-2 and Exon-3 is the last 20 nucleotides of the 3'-end of SEQ ID 
No: 15 and the first 20 nucleotides of the 5'-end of SEQ ID No: 16. 

20 13. The method according to any one of the preceding claims, wherein the presence of at 
least one of the transcriptional product(s) indicates that the Individual has a subtype of B- 
CLL associated with a poor prognosis. 

14. The method according to any one of the preceding claims, wherein the presence or 

25 absence of the transcriptional product(s) Is/are determined by a method selected from the 
group consisting of a nucleic acid hybridisation based technique and a PGR based 
technique. 

15. The method according to any one of the preceding claims, wherein the biological 

30 sample is selected from the group comprising blood, serum, plasma, urine, saliva, lymph 
node biopsy, bone marrow, spinal liquid, spleen biopsy, and liver biopsy. 

16. Use of a compound capable of eliminating transcription of at least one expression 
product comprising a nucleotide sequence selected form the group consisting of SEQ ID 

35 SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 
and SEQ ID No: 18 for the treatment of B-CLL. 

17. Use of a compound capable of eliminating transcription of at least one transcriptional 
product comprising a nucleotide sequence selected form the group consisting of SEQ ID 

40 SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 
and SEQ ID No: 18 for manufacture of a medicament for the treatment of B-CLL. 

18. A method of treating B-CLL comprising administering to an individual in need thereof a 
compound capable of eliminating at least one type of transcriptional product, said 
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transcriptional product comprising at least one nucleotide sequence selected from the 
group consisting of SEQ ID SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18. 

5 19. The method according to claim 18, wherein all types of the transcriptional product(s) 
are eliminated. 

20. The method according to any one of claims 18 or 19, wherein the at least one 
transcriptional product is selected from the group consisting of SEQ ID No 2, SEQ ID No 4, 

10 SEQ ID No 6, SEQ ID No 7, SEQ ID No 8, SEQ ID No 9, SEQ ID No 10 and SEQ ID No 11. 

21. The method according to any one of claims 18-20, wherein the elimination of the at 
least one transcriptional product is achieved by inhibiting the formation of the 
transcriptional product. 

15 

22. The method according to any one of claims 18-20, wherein the elimination of the at 
least one transcriptional product is achieved by destroying the transcriptional product. 

23. The method according to claim 21, wherein the compound is an nucleotide capable of 
20 Inhibiting the transcription of a nucleic add sequence encoding any transcriptional product 

comprising a nucleotide sequence selected from the group consisting of SEQ ID SEQ ID 
SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 
and SEQ ID No: 18. 

25 24. The method according to claim 22, wherein the compound is an nucleotide capable of 
destroying any transcriptional product comprising a nucleotide sequence selected from the 
group consisting of SEQ ID SEQ ID SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID 
No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18. 

30 25. The method according 24, wherein the nucleotide Is an si-RNA. 

26. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

35 i) an amino acid sequence of SEQ ID NO: 3, 

11) an amino acid sequence having at least 60% sequence Identity compared to 
the full length sequence of SEQ ID NO: 3 

40 li) a fragment of SEQ ID NO: 3 having at least 60% sequence Identity compared 

to the full length sequence of SEQ ID NO:3. 

27. An isolated polypeptide according to claim 26, said polypeptide having interieukln or 
cytokine activity 
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28. The isolated polypeptide according to any one of claims 26 or 27, which folds as a 4- 
helical cytokine. 

5 29. Use of an Isolated polypeptide according to any of claims 26-28 in a diagnostic method 
for a subtype of B-CLL having a poor prognosis. 

30. The use according to claim 29, wherein the diagnostic method is based on 
immunological assays, such as FAC5 analysis, western blotting, RIA, ELJSA, 

10 Immunohlstochemlstry etc. 

31, Use of an Isolated polypeptide as defined in any of the claims 26-28 for the treatment 
of cancer. 

15 32. Use of an isolated polypeptide as defined in any of the claims 26-28 for the preparation 
of a medicament for the treatment of cancer. 

33, Use according to claims 31 or 32, wherein the cancer is B-CLL. 

20 34. A method of immunisation of a patient in need thereof against B-CLL, wherein said 
immunisation generates an immune response in said patient which recognises a 
translational product of SEQ ID No 2, SEQ ID No 4, SEQ ID No 6, SEQ ID No 7, SEQ ID No 
8, SEQ ID No 9, SEQ ID No 10 and SEQ ID No 11. 

25 35. A method for producing an antibody with specificity against an isolated polypeptide as 
defined In any of the claims 26 to 28, said method comprising the steps of 



i) providing a host organism. 



30 



ii) immunising said host organism with an isolated polypeptide as defined In any of 
the claims 26 to 28, or transfecting said host organism with an expression vector 
capable of directing the expression of an isolated polypeptide as defined in any of 
the claims 26 to 28, 



35 



iii) obtaining said antibody. 



36. An 



antibody obtainable by the method of claim 35. 



37. An 



isolated polynucleotide selected from the group consisting of: 



40 



i) a polynucleotide comprising SEQ ID N0:5 

il) a polynucleotide encoding a polypeptide having the amino acid sequence of SEQ 
ID No 3, 
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iii) a polynucleotide, the complementary strand of which hybridises, under stringent 
conditions, with a polynucleotide as defined in any of I) and il), and encodes a 
polypeptide, which has at least 60% sequence identity with the amino acid 
sequence of SEQ ID No 3, 

5 

Iv) a polynucleotide which Is degenerate to the polynucleotide of III), and 
v) the complementary strand of any such polynucleotide. 

10 

38, The isolated polynucleotide according to claim 37, comprising the nucleotide sequence 
selected from the group consisting of SEQ ID No:2, SEQ ID No:4, SEQ ID No:6, SEQ ID 
No:7, SEQ ID No:8, SEQ ID No:9, SEQ ID No: 10 and SEQ ID No: 11. 

15 39. A diagnostic kit for ex vivo or in situ diagnosis of a subtype of B-cell chronic 

lymphocytic leul<aemia (B-CLL) in a individual, the kit comprising a detector molecule 
capable of detecting the presence or absence of at least one expression product, wherein 
said at least one expression product comprise a nucleotide sequence selected from the 
group consisting of SEQ ID SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 

20 SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 in a biological sample isolated from the 
individual. 

40. A kit according to claim 39, wherein the detector molecule is a nucleotide. 

25 41. A kit according to claim 40, wherein the nucleotide is capable of hybridising to a 

nucleotide sequence selected from the group consisting of SEQ ID SEQ ID No: 12, SEQ ID 
No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 
under stringent condition. 
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SEQUENCE LISTING 



<110> Rigshospitalet 
Henrik Leffers 
Anne Mette Buhl Hertz 
J0rgen Kjems 

<120> Methods and kits for diagnosing and 

treating B-cell Chronic lymphocytic leukemia (B-CLL) 



<130> P34546PC01 

<150> DK/PA 200201792 
<151> 2002-11-19 

<160> 43 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 19999 

<212> DNA 

<213> Homo Sapiens 

<220> 

<221> gene 

<222> 40000-60000 

<223> Sequence of ac063949.emhum 

<400> 1 

cacacgtagg ctacgagtgg ccctcagcct gcctcatcat ggacctgtgt tataataaat 60 
atgtttaatt gtgctgtttt cttatagagg aaagtcctga tgttagttgc cttgaagtca 120 
gacacccaga gagaatcaca ggttttcaga ttaattcatc gcttgattct tatccctgaa 180 
gtcatatctc tggatctctg gttctcacat tataaatttc aatgattctt tttctatatg 240 
gccatgtcat tcatatcctg tgtaatatgg ggaaactgag gtatgaatga catcattcaa 300 
aaagcacctg caatttttct ttgccaagca cttacagctt tttctcatgt tgctttcaaa 360 
aagtcattga aatattgttc acatattttg cagatgagga aatgaatatt caaatgcatt 420 
aggtatcttg tccaagttct tacagccaga aagtagagaa atgaatttga attacaaatc 480 
ttctacctct tggcttatgc tcttttcatg acactgggaa taaatgtctg aacaagcatg 540 
acttcatgtt tcaactattt atcaaatact tgttttctac taagatcttg cactcactca 600 
gtgggatccc ctgaagcctg ctgattattt gtcctttggc atttatcact ctctgtggga 660 
ccttactctc ctatggtaaa gttttattgt tattaaaagt attatttgac aataaatgta 720 
gaaatcctac agatcatact* caacaacatg tctaatgtca gcacacaatg tctaacaatc 780 
atttatgaat actttatgtc aaacataagc aataacctaa ttaaggaagg tatttttaat 840 
aaattgacac tttttgacat aaccatattt caagtggctc cattgttttg tttatttatt 900 
tatttattta tttatttatt tatttttgag aaagggtctc actctgttgc ccagactgga 960 
gtgcagtggc aacatcatag ctcactacag cctcgacctc tctgggctca agcaatcctc 1020 
ccatctcagc ttcccaagta gctgggacta caggtgtgta ccatcatgcc aggctaattt 1080 
ttcgtatttt gtagagacgg ggttttgcct ggtcgtccgg gttggtctca aactcctggg 1140 
tgttccgccc accttggcct cccaaagtgc tgggattata ggcatgagcc tcaagtggct 1200 
actttttagg gttgaaattt atattgactg tcaactagct tccctagtta gtatttggga 1260 
tctgctaact aatttatatt- accatccaac ttgtcaacat ttgttgaaat ataactgtcc 1320 
tcactttttt tgtgtgaaca ttgaatacac tttcagacta aatttggttt attacttaat 1380 
gtcttattct ttattagagt taataatatt tcttaatact ttgccttcca caaatgaata 14 4 0 
acttgtttgt gatggctacc tctttttttc tcttagcctg tcacaggtat tatggataaa 1500 
aattagcacg gctgggcaaa aacaatgaaa gaaatacact tgcctgggaa agctggggag 1560 
gggtaaatga atataattca aaataccata tatttattca acactgttgg aatatatgtc 1620 
ctgttggaaa tgtaaaagtg acatatgttc tcttcctggg tctcagactt ttaggatcta 1680 
gttgagggaa ctggacttat acacaaaata caattcaaca acattatgag ctagaaaatc 1740 
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catgagctaa agtctttggc aaagacatta 
ttactgtggg ctggaatggt ctgggaacat 
aaggatgagg ttcaaagaga tgggaaaaaa 
aaaaacaagt tgccagaaag aacaaggaag 
ctagggaagg taatataact agagacggca 
actttagccc tcattggtac gtgcagaaat 
gtaaatatat gttgggaaaa aaataaccgg 
gagaaaccta ttccatgcta tgaaatattt 
tacttttcca agccacaaga ggaaacattt 
tttctccact acgtctactc ttaagctctc 
aaacctccta gatgatatac ttatggaaat 
ggaaagacat ttctagcagg agagcatgaa 
ggttctgact gtggtcactc taactataaa 
catttccttg gggtccctaa aaatttcaag 
tgatttctca acacattcag atatcttatg 
tcagaaactc atatattaca gaataaaaaa 
aatgagagca gggttctatt tcattttgga 
tgataaaatg ttaacattag ttaattatta 
tatactttta tatgtatcac ctacatttct 
tctacctcgt atgattgctg gatgatttaa 
aaagcagaaa cttttagtta ttaagattct 
cctgctaaga aaccccaata atattctgat 
taatctaaca tcttcatatt aattttttta 
gtgtttctaa acgctggatg catttttaac 
ttttgtttta actatgattc agcttataac 
ttaataaggt taatgaaaag ttgacttatg 
tgttttcctt gctaaggata ttagtacttg 
tgaaactttg atgcaaatat cttatcaata 
tttcttttct tctttctttt ttgagacagg 
acccctggcc cctggcctca agtgatcctc 
acagatgtga gtcactgaat ccagcctcac 
ctggttctaa aactttttga taatactctc 
tggattacgt agttatgaat ttcataaatg 
ttatacagga tgtataacat aaccatggtt 
aaagactggt caggaccagg gcacgcatgc 
gccaggcctc ctgtgaaaaa aatcagtaag 
ttcctgaagt cactaccaca ctttgcactt 
gaattctggt gagaacgaca cgtctgaaat 
tcagagaatc ttctcacata gcgccaaaca 
aatcaatttc taaaattaaa agtgaaaaca 
aaggctgatg ttactaaaag acaaaagaca 
ctcagggaaa tgcaggaaga aataaaagat 
tttatgtgtg gacccgaccg atctatggga 
gggtgcgtca gtgagggtgt ttttggaaga 
agcagatggt cctcaccaat gtgggtgcac 
caaaaaaggc aaaagaaggg tgaattccct 
tcctaccctc agacatgaga ggtttggatt 
,ggtggccacc tctggtttca ggtctttggc 
ttggttcttg ggccttgaga ctcaagctaa 
tgtagggaca gcaaatcatg aaaccttctg 
ataaatctgt gcttatatat ctatagcttt 
atgtacacac tatatgacct aacctgtagt 
tgatggtatt ctcagtatct gggaggtatg 
taggactgag gagatttctg aaatgtggaa 
gccatgtgga aaacatgtag aactatcgta 
cccaaaggaa aggaaaccag tatattaaag 
ttattctcaa tagcctagat atggactcaa 
atgtggcata tgtacaccat gaaatactta 
tggtgtgtgc ctgtcatccc agctccttgg 
aagtttgaga ccagcctggg caaaataata 
ggccttcctt aaaaaaagaa taaaatcatg 



ggtaacatga ggagtcagga aaaggagaaa 1800 
gagatggagg aagtggcttg ttactggaga 1860 
agaaagagag aagaaagaaa agaaatgagg 1920 
aatagaggca ggtaagcagt ggattttgcc 1980 
gtttctaaca ggccatgatg aataagatac 2040 
tcaaatttgg aaattcaagc ttacatgaca 2100 
taaacattta catcagctct ttttcctaaa 2160 
gtcacaattc tgttttcaaa atacttgctc 2220 
tctctgccaa cactctctga ccttaaccag 2280 
tttagagctg tgtgtatctc gtctttatgt 2340 
attcaggcaa ctttttcatg aactttacca 2400 
tagaaatgga ctcttcccca gtctctgctg 24 60 
aagtgtgtaa aaatcatgag cagattattt 2520 
gtatctgtat tagcacagga agatttaaat 2580 
aactttatta agataaattt cctccagcat 2 64 0 
taaagcagaa aattagtgta cctggctaaa 2700 
aagtcactaa gacagtaata ataccattaa 2760 
gatgtgtttt tgtatgccag ccacataata 2820 
agatgtaaat gtgagggaat tatagtagta 2880 
atgagctgtt gtctcaaaaa cttggtatag 2940 
tactattcca atatttgaat aaaacagtga 3000 
acatcaaaac cttctggcat tagatgtttc 3060 
tgttttgatt atctacattc agtagtgaat 3120 
taaatgtgtt ttgtaccaca ttttgacaac 3180 
aaaacaaaac aatgcatctt ctctccactg 3240 
aaaaaaatcc taatttatgc acattctcat 3300 
acgattctgt aacaaagaat tatcatggga 3360 
caatgtgctt gattttacct agatgagatt 3420 
gttttgctgt gttacccagg ctagcctcaa 3480 
tcacctctgc ctcccaaagt gctgggtatt 3540 
ttagttggct ttcttagtga attattttat 3600 
aaatatttat ggattttata acataattta 3660 
attttgtgat attgccacag atcatcacca 3720 
taatatattt tcataaacta tagaccaaac 3780 
attttatatg tgtggtgcct attggaatat 3840 
tgcttatctc ataggaccaa cggcccaaca 3900 
atctccatgt ggaaatagat agccactgtt 3960 
ctctcagctt cacaacccct attacagccc 4020 
acaactttag gaagtgatgt tcctagaatg 4080 
atgacaagga gaagggaggg tcagagagga 4140 
gtataacctc ttatgaggat ggtccagaca 4200 
aggagtttga accacactgt gatggctaac 4260 
cacccagata gcttgtaaag cactatttct 4320 
gatcaacact tgagtcagta gactgagtaa 4380 
attgtttaat ctgttgagtg cctggataga 4440 
ttctcgtctt aagctgggac agccatcctt 4500 
cttggatctt tggtcccaag ggctgacact 4560 
cccagattgt aagttacacc atcagcttcc 4620 
aatacactac cagcttccct tgttctctag 4680 
cctccataat catataagtc aattcccgta 4740 
ccttttggtc tgtttctcca aagaacctta 4800 
aatgataacc ttatgcaggt ttgaataaga 4860 
ggctagagtg atgaaccacc gccatgagcc 4920 
tatttggtgt caaaaccaag agataatata 4980 
tgattcagca acccaaccac tgggaattta 5040 
agaatctgca ctcccatggt tattgcagca 5100 
cctaggagat tagatgaatg gacaaagaaa 5160 
ccagctataa aaaagaatta gccaaagcag 5220 
gaggctgagg tgggaagatc tcgaggccag 5280 
agactcggtc tctaaaacaa tttaaaaata 534 0 
tcattcacgg caacatagat gggactggag 5400 
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gatattactg ttaagtgaaa ttagccagga 
catatgcgga agctaaaaaa acgttgatct 
tggagactag aaaaggtagg gagaaggaag 
aaacaaaatt acagttagtt agggagaatt 
actatagtta ataataatac tttaattagt 
ttcccaacac acacaaaaaa atgataatgt 
tctgatcact ctacattata tgtatcaaca 
tttttatttg tccatttaaa aaagataaca 
tgttccatgt aaacctggat gaactggtca 
aaagtttccc aggctccaca tcaagttgtt 
ttgggcaaat agtaatttac agcaatagtg 
tctttggcat ctgcttcttc ttttctgtaa 
ttcttcccta gagctattca gggtgacctt 
tatatcaaaa taaaacagtc ctgtgtggcc 
aagataggat cggctgctct ttcttctcct 
gtcacccaga ctggagtgca atggcacagt 
cctcctcagc ctcctgagta gctgggacta 
gatatatata tatatatata ttatatatat 
atattatata tataatatat atattttata 
atatataaaa tatatatatt atatatatat 
gtcacccagg ctgaagatca gctgctcttt 
caggggataa aatactacct ggaataaagg 
atttttatag tatgacagag gcagagaaaa 
ccactcccag tacaatcatc tgtctgttac 
ggggtacaaa gtagatgcca aacttcacaa 
atcaaacctt tgtgaaagtt tccagctttt 
gaatctcaag agcataattg ccaagatagt 
tgaaatctgt tcccatcatt actcaagata 
tgctcaataa tgattgattg gccaatggag 
tgcaaggggc ccacaggaat cagtatgatc 
aagaaacaga tagtggcttg tattaggttt 
aaaagaaatt tgatcagagc ataaatctga 
aggaagaaaa cacagattat agacaggaaa 
atactaaaaa tgtacattca tttgaacaaa 
ttaccagcac taccatgccc aaactacctt 
gcacctgaaa tttaaaacaa aaaccaattt 
gcctgctaac aatgtcaaag tctcgtttac 
caaacgagtt caaaataaaa agcaaaccaa 
taccatctcc ctcagcctgt gaaattctgg 
tctcacctgg aagacgagca gagtgaacac 
cacattcata aaccaccgac tccaggagaa 
ctttggatct tcctcatggg cttcaaggca 
tcatatctat tagtttgaac catgcaaaat 
ttggcaattt caaatgatta accctaatac 
tcttaaacat atgctgttac atacaaaatc 
aagagaattt cttttggtca ctactccttc 
cctaaaggtc tgggtctggg tcatttctcc 
tcattgtgtt ctgagctgtc tcataaaaca 
gtggctttcc atcgttggca cttcgttggc 
gtttccaaat tggcattatt attgttatga 
tctcctgatc tctctcacct gtacatttct 
tcatcattgc ccaagtgtgt cttctttctt 
acatatgcag agatggctct tggttttcct 
gttagcaagg ccactgagaa^ gagaacaaaa 
tctctcattt cctggtttct agaagtaaga 
gctcaggcca aactagggtg atcaaactga 
ccaagtagac agaccatcta caaatcttca 
actaattggt aagactatat gtgtgataat 
aggatgacca ggtagctggt aaaacattat 
agagatcagc atttgaattg gtgaacttag 
ggcatcatcc aatctgtcga aagcttgaat 



acagcaagtt aaaccccaca tattctgatt 54 50 
catagaagta aaaagtagaa cagaggatgc 5520 
ggagagggaa aaatttgtta acaggtacaa 5580 
aattccagca tcctgtagca ctataggatg 564,0 
ctcaaatagc tagaaggagg atattgaatg 5700 
atgagatgat ggatatggta gttatcctga 5760 
catcactatg taccccacaa atatgtagaa 5820 
aatttaaaaa taaaataaaa actaaattag 5880 
ccctacgtct gcccatctag atggctggtc 5940 
ccactgctca ctggaacttc cctagtcagg 6000 
aatttatcac tgacatttct tcagttcccc 6060 
tgctgtttgt tgaaatgccc aacattcttt 6120 
tcttttcgca ttttcccatg ccacttccat 6180 
actgctcatg accttgtttc ctgccatgtg 6240 
cctttttttt cagagacagg atctctccct 6300 
cgtagcttgc tgcagcctcg aactcctgga 6360 
caggtgcaca ccaccatgcc ttcctaatct 6420 
aaaatatata tataaatata tatattttat 6480 
tataaaatat atatattata tatatatatt 6540 
atatatatta tagagatggg gtcttgctct 6600 
ctaatctgtg gttagataag atctgtctcc 6660 
tatctttaaa ataatcccag agaagaaaac 6720 
cagagaatat ttgttaaggc aggactttca 6780 
ctgcatacct tacacgggct ggcactgctg 684 0 
tggttagatt catgtttaaa aagccattgg 6900 
ttctgttcca aatatgtgtc cattataaaa 6960 
ctatgtccat gagtatttca acatctctca 7020 
ttgtatgaac agtattccac ataaactagg 7080 
ggtcattatt taatgcacta caatctttta 7140 
ccataggaat ccttttcttt tccattgaaa 7200 
cttgtgtgtg ttgtgaggtg gaaagatatg 7260 
gcccatggga taggaaagaa tgagggaata 7320 
atcaaaccta ttaaaactga taattttcga 7380 
aagattctat aaagcaa'gat ttctctgttc 7440 
aggaaatgaa tagcagagtc aaacttaaaa 7500 
acattttatt taagaaaagc aaacagatgg 7560 
aaagaaaaaa acaaatctgg aacctgaagt 7620 
taaacagaaa ccaacataaa cagaagttac 7680 
aacttctctt tctttctcgc cttcttcttc 7740 
atcaggggtt gtcagttccc cagatggcac 7800 
tgtaggaagc ttagttaagg ccaaagttct 7860 
aaagaaaaaa aagtttgctt gagaatatct 7920 
tacagttttt ataggtaaaa tgagtgcata 7980 
attatgcttt tgggtataga aatattcaga 8040 
aggtatattc ctgcttctat aattaaagca 8100 
tgacatgagg tatgaaccaa gttcaggacc 8160 
acctctaact tgtgccgctt tcttggtcag 8220 
tctgctatga ctttactttc tcctgatagg 8280 
cttattggta tgctttatac actggttctc 8340 
ttcctgctgc tctcccacat ttcccatctt 8400 
tacattttct cctgtgcttc cttcttccca 8460 
ctccttgtca catttccttt gcccgctctc 8520 
tctgaaatct catagtttgg aggtaaactt 8580 
gggaaacata agagaaacca agtcactatc 8640 
cccaaagaac tcactgtttc agtgctttca 8700 
gcttctgagt gctgatcaaa acctataaaa 8760 
ctgttaaata ccataaagaa tgaaaaggtc 8820 
taaatttatg catcaacctg gctaggctaa 8880 
tctgggtgtg tccataagag tgttttcgga 8940 
taaagcagac ggctctcacc aataagggca 9000 
aaaacaaaaa gaggaaggga aaatttgctt 9060 
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cttttcttct tgatctagta tatcatcttc 
acttaaacca ggagttacac ctttggcttc 
aattacactg ccaggtttcc tggttctcca 
ggcctccata attgtgtgag tcaatttcca 
taacaatgga gacaggttct gagaaatgca 
tcatagagtg taactacaca aacctggaca 
gtgtagcttg taacctcatg ataagtatgt 
ccatatctaa atgtagaaaa ggtacagtaa 
catatatgca atcctttgta gactgaaatg 
acatacacaa acacacacaa atatactatt 
atatgttata catttatatt gactctattt 
ggagatgggg cataggtgtg tgaactggga 
atcatttggg acaagcccag ggcatcagct 
ttatggcaaa acctcccaca agacctctca 
tgacagatga catttatgga taaaataggg 
actaaccttc atttctacca ggcagtgggg 
tctgggagat actaggctgc actgtctgta 
aatgcagttt ggtataatta ttgccatgca 
tactcttccc tgaaaaaaca ttttcttggt 
ctgagctttt aaatatataa atactttcaa 
caaacattga attcaacatg aaaatgatta 
tcttaaggac tcaggtaaat acttagggta 
tgtttcttct tcaaatcaca gattcaaaat 
aaaataagtg aaccaaaatt gcttgcttct 
cttgtgctta aaattttgaa tagtaaaaca 
gctttttaga cacaatttta agtacatgaa 
aaattacagc tttcttcttt ttaaaactgg 
gaaaagcata attattactg attattacag 
agaggaggaa ggagttaaca aatgatccac 
ttccaggaag tgcctaagac agagctggct 
ctgctgggct acatgccacc atcctcagtt 
ggaaataatc tttaattatg atatagctga 
atgtgaatta acttttgtca aatttatttt 
ttatcctacc ttaaagagga gaaactgagc 
agactgctag tcaccctagg ctattaactc 
tgctactgtt tgcagctttg actatgcctg 
gctaagggaa ataaaatgag ccatagggct 
gaaaagtgat aatgcagagt ttaaaatatt 
aagtgtgacc aaaaaaaaaa aaaaaaaaaa 
tctgagaaat agaaatatca gaggaaggaa 
agcattcaga ttccacagat tccacaaatc 
taacacttga cctaacattt cattgcagtt 
agtggtttgc tctaatttct tcaggaatag 
agaaaggaaa ctatcagaaa aatacagaat 
ggtaaaattt tatattgtaa tcttaagtat 
cagcatcaga atctgccgag ggcttgttaa 
tcactgaatc agaatataaa tactgacaag 
tggcgaagtt ccattgtagc ctgtgattgt 
caaggaagaa aacccaggcc ttagcttctg 
ctgcatgact catgagcaag gaaatcaaac 
atggagacta taataatcat ctcctaggct 
tcattcctct atttacagac ctgccgcaga 
ctgttttcta aacttagtaa ttgagtgtga 
gattatttat ttattttctc aaaacaacag 
attattttac atttctatat tctcctaccc 
cataaagcca ggtatacatt atggacaggt 
tttgtccaaa aattttaaat ttcaactgtt 
agtacagtag tattcagtac agtatttaag 
atgaagttac atgaaaagca aatttgtgtg 
tatgttcccc acaaattgaa atgcatttca 
cagagtgtgt gtgagagaga gacagagaga 



tcctgccctt ggatgtgagt gggccttcag 9120 
cctggttctc agttctttgg acttggactg 9180 
gcttgcagat ggcagatcat gggacttctt 9240 
ttttatttac atatccagtt atgcattgct 9300 
ttgttaagtg atttcatcat tgtgcaaaca 9360 
gcatagacta ctacacatct aggctacatg 9420 
ataacatcat gataagtatg tatgtatcta 9480 
aaatatggta taatcttatg ggatcaccat 9540 
tcattgtgta gtgcatgact gtatacgcac 9600 
ggttcttttt ctctgaagag ccctaataca 9660 
caaaatttat ggttttggtg aaacatatgt 9720 
tagtgtcctg ctgatgaatg ggtgggaggc 9780 
tatagatatc aagagctcaa caagagcact 9840 
gaagttgaga aactgctaaa agtttcttta 9900 
attagcagga ttctttaaat actttcgaac 9960 
ccccaagtgc agggccatag gaagtacaag 10020 
gagaatctga aaaaataata gagtcactga 10080 
tcataattct aaatcatact agtggtcaaa 10140 
ttgaattcta aataattgtt gtggtcacca 10200 
gtttgcatat ttttattacc tgttccttaa 10260 
tgggaaacat tcgggtatac agtccctgac 10320 
tttcatggcc ctagtctttg gggtaccaca 10380 
caagaatgat aacacagtga ttgtgtagac 10440 
gtcattctat ggaaccactg agagttttta 10500 
gagtgtcaac ttcatgctgg aatatttttg 10560 
gtatttttac aagactaagt aacatcactg 10620 
tatttgttat aaaactaaag agcgaatcaa 10680 
gattattact gaaaaagaaa tgtacggaat 10740 
tctgggtgtt gaaaacacca ataagcctgc 10800 
cagcttgctg ggtcacagca tgtaaggaaa 10860 
gtccagatag ataatcccat agccccatgg 10920 
caccattcaa agcactatgc taagtccttt 10980 
tcataaataa cccaaatatg tataccacta 11040 
tcctaaagtt taaatatcta acccaagtta 11100 
aggcagtcta actcaggtat aataacatta 11160 
aattataacg tcatgctatc taactaaaaa 11220 
caatttcata aaaggagaga aaatactggg 11280 
tttgtaaaag tgccagagat tgagtataac 11340 
aaaaggaaga aggtaaaaaa aagagggagg 114 00 
ataaaggagg gtgagagtaa attctctttt 114 60 
acatttcttt ttttaccaac taaggaaaaa 11520 
agctaaagga tgctagaaaa actatgttgc 11580 
agaaaagtga caaaaagatc agagaagaga 11640 
tggagtagga tataacatat ttgggttgaa 11700 
cttgctactt cagtttggtc cctggaacag 11760 
aaaggcagaa tctcaggtcc catcccagac 11820 
atgccccggg attcatatgc acagtagagc 11880 
tttctgcaac ttagtatttc tgagttttcc 11940 
gcagacttgt gtttctcctt tacttactag 12000 
tttatgtgcc tgagtttcct catctataaa 12060 
tgttttgagg atgttcaaca aatgctcctt 12120 
caattctgct agcagccttt gtgctattat 12180 
tctggagact aactctgaaa taaataagct 12240 
aatacgattt agcaaattac ttcttaagat 12300 
tgagttgatg tgtgagcaat atgtcacttt 12360 
aagtaaaaaa catattattt attctacgtt 12420 
gcgcgtgtgt tggtaatgta aaacaaactc 124 80 
cccctgtact taaacatatt cctcgtacca 12540 
agatatcgta gatggaagta aattagtctt 12600 
aaaactctgt gtgtgtatgt gtgtgtgtga 12660 
tacgctttgg ttgcctccat aagctggctg 12720 
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ctatgattaa 
tgactatctt 
aagtgatgaa 
gcagcctgtt 
gattatcctg 
gggaaaaatt 
catttcttcc 
aaacccacct 
ttctttgaat 
aaaaaaagtt 
acgtatggca 
cttaaaggac 
tagctgttgt 
tggcaggtgt 
tcacctgaga 
gaaaatacaa 
gctgaggcag 
ccactgcact 
gactggtttt 
ttttcctagt 
tgcaaactat 
gtaggctttg 
cagtgagaga 
caggtaggag 
gtggactaga 
cattgggatt 
tgacattctt 
aaaacataaa 
ttccatgatg 
caaggcaagg 
ggttaattgg 
ttcccagctt 
tcccatcagc 
gaaaatgtgt 
gaagctaaac 
atcatttatt 
ggggggggcg 
agtcaagaga 
agaagatccc 
cggtggctca 
caggagattg 
aaattagccg 
gagaatggtg 
ccagcctggg 
cagcagaaat 
tggttaagag 
cactgtgggg 
agttcgttgt 
ttataggctt 
ctgctccaaa 
ctctcaataa 
gcaaaaatgt 
tcaagaaata 
gcatgagtac 
atttccttaa 
ttgagaaaaa 
tccttcctcc 
tttctttctt 
tcctttcttt 
ctttcttcct 
tttcctttaa 



taagaccaag 
ttatcagggg 
aattcattca 
actatggctc 
acaagaatat 
caaaagaata 
ctgactaaca 
gacttgttaa 
gatcttggca 
tcatgaatgg 
ccctaaaact 
accctctttt 
tattgctttc 
ggtggctcac 
tcaggagttc 
aaatcacccg 
gagaatcact 
ccagcctggg 
tcaacagcta 
ttgggtaatg 
gtttagaagc 
catgaatgga 
agcaaaagtg 
gattggctca 
ttaatttcat 
tgatggtaga 
tgaattcctt 
aagtattgtt 
aaatgacata 
tagctacctg 
ccatggaagg 
ctgcatcctt 
ctcatgaagt 
ccagtaatgc 
tgagaaatct 
gactagtatg 
gggttcacac 
gtaaggaatt 
aagaaacctt 
cgcctgtaat 
agaccgtcct 
ggcgtggtgg 
tgaactcagg 
cgacagaacg 
agagccaaat 
agcagactct 
ttctaattaa 
gccccaattt 
actgtgagca 
agagtttgtt 
ctggaaatga 
tcgtttcttt 
ttgttgaatg 
aaatctgggc 
ggaactgata 
cttattttct 
ctcccttcct 
tctttctttc 
cttcctttct 
ttctttcttt 
gcagaccatg 



ttttctaaag 
caaaaaggaa 
caatgattgc 
tctggagtga 
actcagccaa 
aatattcttt 
agcagcaagc 
tatttttgtt 
aatgtctcga 
gtgtcaaaat 
gtattagaaa 
acaaactcac 
tatataccta 
acctgtaatt 
aggaccagcc 
ggtgtggtga 
tgaactcagg 
tgacagagca 
ttcccacccc 
gccctctggg 
atgtctggga 
aaaggagagt 
gggaagcaag 
gtcatcttgg 
ccattacagg 
aggtattttg 
tcaagcaaag 
gctctgctca 
aatgaggttt 
ggcagagctg 
cagtcattaa 
ccctgtccct 
tctctctcat 
ctgattggcc 
tgcaaatcat 
tggcaggcag 
atcttaaaga 
gtctttgttt 
gggatcattt 
cccagcactt 
ggctaacgtg 
cgggcgcctg 
aggcggagct 
agactccgtc 
catgaacata 
taactagaac 
cccctgttac 
gttcatctgt 
ttaaatgagt 
aaacacttaa 
tcaagcatag 
tgttcagccc 
aatcaatgaa 
aaagtgacct 
aattggtgtg 
ctccctcttt 
tcctccctcc 
tttctttctt 
ttgtgccttt 
ctgcctttct 
tctgttagat 



aaaatgagat 
agagacaaaa 
tttcaagagt 
tagctaatgt 
taatgcaaca 
tttttttttt 
acttaaaaat 
tggtcccagg 
attatttttg 
tgattagttt 
aaaagtaagt 
aaaaacagcc 
catcttgtct 
ccagcacttt 
tgatcaatat 
cgggcgcctg 
agtcagaggt 
agactccatc 
tctgcatgga 
caggactgga 
aatgtcatgc 
aattctatgt 
aggaattatg 
ctgaggctca 
aagaggagcc 
ggactattcc 
gattaaattt 
gagttttatc 
tttattgttg 
ttttatttct 
gatgttccca 
caattccatt 
ttattaaaat 
ccttatccta 
tgagccaaaa 
tgccctttta 
ggtgctatct 
ggttatattc 
tagactaaga 
tgagaggccg 
gtgaaaccct 
tagtcccagc 
tgcagtcagc 
tcagaacaaa 
agctaaacaa 
actgcactcc 
ttaccagtgg 
agaaggggta 
tactactgta 
gaactgattt 
gccctggaat 
tgtgcctaga 
cctaccgagg 
ccaaggaaat 
atacaatgta 
tttccttcct 
ctcccttcct 
tctttctttc 
ctttctttct 
ttctctttgt 
gaatgccttt 



cataacaaaa 
cagcatgaaa 
aatttctctt 
aaatgaagcc 
gaaatccatt 
aaagttaatg 
atccagccag 
gactcagatt 
ccaacttttc 
taaaaacctt 
actctgtagt 
tttggaatac 
attataaaaa 
gggaggccaa 
ggtgaaaccc 
tagtcccagc 
tgcagtgagc 
tcaaaaaaaa 
aatattcacc 
gtggggcaca 
aagaaaagac 
agagcagagc 
cttttcatca 
tgaaaccagg 
gtgaaagata 
atttgaaatg 
acccatgagt 
taactcattc 
ttgttgttgt 
ctatgccgtg 
tgcgagtgaa 
gttggtgatg 
ttgctttcag 
aaggcttaaa 
acgtattaat 
tttaggcagg 
cctcctatat 
aggggattag 
aatgccaata 
aggtgggcgg 
gtctctacta 
tactcgggag 
cgagattgcc 
acaaaaggaa 
atgttggcag 
atgtcctcac 
cagtcttaag 
ggatgacagt 
tttgtaaagt 
acttgcatct 
ataagcaggt 
tcaatatcta 
tagttacata 
ttccactttt 
aaaaaatgtg 
tccttcctcc 
tccttctctt 
tttctttctt 
ttctttcttc 
ttttatttct 
ttctagttaa 



gccctcttta 
tgatgagacc 
gggtaattca 
tctaaaagtg 
caaagcattc 
acctacgatc 
gatgaaatag 
ctaagccaaa 
tttatcttgg 
tcttgcagat 
gtgaaaaatt 
ccacatgaag 
gactggtttt 
ggcgggcgga 
agtctttact 
tactcgggta 
tgagatcatg 
aaaaaaaaaa 
cagtcaattg 
caggagaagc 
atatttaaag 
ctcttacttg 
gccaaatttg 
tgtaaagaaa 
atccagaaat 
agaaggtacc 
tgactcagaa 
tcacttctta 
tttctggaca 
gagagaaatt 
ctttccaggg 
acaatgtctc 
gaaaaatttt 
ctggaggaag 
agcaagatct 
gagagttgat 
aaatcatgta 
agtatacagt 
ccgccgggcg 
atcacaaggt 
aaaatacaaa 
gcggaggcag 
ccaatgcact 
atgccaatac 
tgtagcctag 
tgtagaccct 
gcattcctta 
agtgtttact 
gcttaaaatg 
aaactgacag 
ctacatgaag 
gtgatcatgc 
aaagagttct 
agattctgtg 
cctatatgat 
ctcccttcct 
tcttcttttc 
ctcttctctt 
tctgtccttt 
ttccttcttt 
aaggttaaac 



12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480. 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 



wo 2004/046376 



6/48 



PCT/DK2003/000794 



aggaaagtga agcacaatta tcaagggtct 
tcttctttta cagtttcata tctccaggcc 
cctttatgtg tgtgacaagt gaaaataagg 
aatctgcgca gcagttcctg ggcgtttcag 
cccagcatcc atctccttgc tcatcttaca 
ttatcagagc aaaggctctc ccactattct 
agaatatctg cacggatgac ctgcctcacc 
ctatcgcacc aaggacagat ctcccagcaa 
atctcccttc caaaacacag ttctcttaat 
cctctctaag gacctgaaaa caactggcca 
ccaaccgcca aaatattaaa ccattttggt 
gctgcttctg ctaggtgcaa aaatggaaaa 
tacctttggg attctaaatt tactcatatt 
aaaataggat tattccttta gctcgataag 
atttaatcaa tcagaaattg atttttggaa 
aaggattttt cttttacaat ccagactata 
ccattggtca gagttttact gaccaattcc 
tgcctgtttt ctcttcacat accccaacag 
ttattttgtg gcctctgatt acatggtccc 
taaaatttct taacatatta ataatatttt 
attaagacag aggtcttaac gttctctttt 
aagagattct cccaggatca cagttaggta 
acagctctaa cgtatataca ccctttgtat 
ttcagtgatc taagtcaggg gtcagcaacc 
caggctttgt ggaccctatg gtctctatca 
ggagccataa gcaaaatata aactaacgaa 
tctttattta caaaagcagg tggcagatca 
ccctgacctg agaaaatctt atatttatgg 
gaacaagaag ctctgtcaac tgaaggtcaa 
ggtgttaatg ataagtgaaa tagttaaagt 
aataatgacc atagactcct gaacaagaat 
gtatggtcca ggccaagtta cctaatctct 
agataataaa agtattttcc tcagagagct 
gcacatagaa tagggcccag cctatattaa 
tctctatatt tttattcatt atcataaaat 
gagttaaaat tgtatgtatg tgaagggaaa 
ttaccccaaa tttgatggct taaagtaaca 
gaggagggct tgtctgggaa gacttgtctc 
tgacggactg gcacatgccc ttccagaatg 
ggctcttggc tgggagctca gctggggctg 
gcctgaactt cctcaccaca aggcggctgc 
caagatgaca ctgtattact gtgtaagacc 
atcccactct atttttaaaa agtgaattat 
attagacttc atctgtatta agaaaaatgt 
ctaaagtcca cttactggct gcagattatt 
tctcttctta gaaggctcag ttgcaggtac 
gtatttatcc tctcaatctg aagacttgta 
tacgcaatat tcaatggtaa agtatacata 
atagaggaaa atgagagcac ataacagtca 
gcaaacacaa gcaggtcttt ttttaggtct 
ttgggtcttc cctccaggtt cttggttctt 
tcctttaagt tcaagctggc aaaatatgat 
cctgtgattc ttattgggga atactccatt 
tctacctgag atccctgtaa ggctgtgatg 
ctcattgttt actgagagaa tatgcctagc 
tcaaagggct tatgagacat taccttatat 
cttttgagtt tgatctttga gctgacacct 
gagagactga caaagagaaa tagttttata 
attatcttaa aattttcctg aaaatgtaac 
cacacttatt atatatgact aaaagaaact 



ccagtcatct ccacatgttc ttaatcatta 16440 
tttcattggg tcaggttggc atttcgctgc 16500 
aaagaaaaaa actcaagtga agaaaatcag 16560 
ctgcttccca catcacctgc ctcatcaagc 16620 
ccctgtgtgc atgacaggcc caccattcat 16680 
ggttcacccc cctacttagc cagatataca 16740 
tgggagctca gaggagctca gattccatta 16800 
gaatgacaga aaagactaac tgcccccaaa 16860 
tctcccaaga aaccagaatg tgactgctca 16920 
tttcagctat ttaaatcaac tttaaaaaat 16980 
tggaatgata acataactaa cctgctgaca 17040 
aaaaatactt ctaatcaggt caaatcactc 17100 
ctcaaagaaa tatattcagt catagtgggg 17160 
caaccagaag ttcttccttc aaatcttgac 17220 
aactgtttcc tatgaagcta tctctgcctg 17280 
gaaggaaatt cacaacctgg actttcacct 17340 
cacctctgcc ttacacctaa cggaagttta 17400 
ttacaaatgg ttgttattat taagcatctt 17460 
ctaaattttg acctaatcac aaaagattgg 17520 
gtttatgtgt caatatctta gcatgtatca 17580 
tgaaagagaa tattaggatt cagagatatt 17640 
acagagctgg attttagtcc aggtctgtct 17700 
aacatgtcac gaattcagca taaagggatc 17760 
ttttctaaaa aggaccaaat agtaatattt 17820 
taactgttca aatcaccatg tagtgtaaaa 17880 
tgtggctgtt ttatgggatt tttttttaac 17940 
gaactcactt atgggccata gttctctgac 18000 
acaacattta gactgtgact tgccaagtaa 18060 
ggctggagtt ctgaaagcaa agagctgtct 18120 
tagaagatcc cagttataag aagcacaaag 18180 
gtctggactt ctggcttagg cactcttgtt 18240 
ccaggcctcc attttcttat cattaaatga 18300 
gtaagaataa actgagctaa cccatgtcaa 18360 
tttatcaata aatgccagct acatattagt 18420 
gtttatctac agattggcat tgtaaggatg 18480 
ttattcctgt tactattgat ctgcatcaca 18540 
acattcattt tgcaaacaaa tttgaaattt 18600 
tgccctatgt ggtatcagca gggggaggct 18660 
gcccactcgc atgcctgcca agttggtgct 18720 
agtgctaggg tccctgggag gttccttgtg 18780 
ggtgcgagag tgagcatttc aagatagagc 18840 
cagcctggga attaatgtag cctcacttcc 18900 
taaggtcacc ccatattcaa ggggatagga 18960 
ttttaaaaat tgtagacatg ttttaaaatt 19020 
tatatataca tgcaagatac actcctacat 19080 
agatgaagct cttcaagtga gatttcttat 19140 
aactaagaga caagttattt gcaacctaca 19200 
ggacagccac tacagacact cttgttttaa 19260 
ttggctcata gcaactctga tatccagaca 19320 
cagtcctact gcctggattc cctactgctc 19380 
ggacctcttt tcatttaata ctatttctgt 19440 
tgtacaattc tgtttaaaat tccaggactt 19500 
agacaagaat ctctttgaca taagccattc 19560 
ggaccacata accttaaaat tattagaaga 19620 
atatgcttag atccttagag gaactctgtt 19680 
ctttctaagg tacaaacaaa aggtctttgg 19740 
tttcttaatt tgagaatccc ctgctctatg 19800 
tttgaatgta acatcttgga tctttaatag 19860 
agttcctttt tttaaaattc attctcccta 19920 
ccctggcatt ttcaacattc tggttagaat 19980 
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<210> 2 
<211> 3893 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 

<222> {0)...(0) 

<223> short version of mRNA 

<400> 2 

atcagaggaa ggaaataaag gagggtgaga gtaaattctc ttttagcatt cagattccac 60 

agattccaca aatcacattt ctttttttac caactaagga aaaataacac ttgacctaac 120 

atttcattgc agttagctaa aggatgctag aaaaactatg ttgcagtggt ttgctctaat 180 

ttcttcagga atagagaaaa gtgacaaaaa gatcagagaa gagaagaaag gaaactatca 240 

gaaaaataca gaattggagt aggatataac atatttgggt tgaaggtaaa attttatatt 300 

gtaatcttaa gtatcttgct acttcagttt ggtccctgga acagcagcat cagaatctgc 360 

cgagggcttg ttaaaaaggc agaatctcag gtcccatccc agactcactg aatcagaata 420 

taaatactga caagatgccc cgggattcat atgcacagta gagctggcga agttccattg 480 

tagcctgtga ttgttttctg caacttagta tttctgagtt ttcccaagga agaaaaccca 54 0 

ggccttagct tctggcagac ttgtgtttct cctttactta ctagctgcat gactcatgag 600 

caaggaaatc aaactttatg tgcctgagtt tcctcatcta taaaatggag actataataa 660 

tcatctccta ggcttgtttt gaggatgttc aacaaatgct cctttcattc ctctatttac 720 

agacctgccg cagacaattc tgctagcagc ctttgtgcta ttatctgttt tctaaactta 780 

gtaattgagt gtgatctgga gactaactct gaaataaata agctgattat ttatttattt 840 

tctcaaaaca acagaatacg atttagcaaa ttacttctta agatattatt ttacatttct 900 

atattctcct accctgagtt gatgtgtgag caatatgtca ctttcataaa gccaggtata 960 

cattatggac aggtaagtaa aaaacatatt atttattcta cgtttttgtc caaaaatttt 1020 

aaatttcaac tgttgcgcgt gtgttggtaa tgtaaaacaa actcagtaca gtagtattca 1080 

gtacagtatt taagcccctg tacttaaaca tattcctcgt accaatgaag ttacatgaaa 1140 

agcaaatttg tgtgagatat cgtagatgga agtaaattag tctttatgtt ccccacaaat 1200 

tgaaatgcat ttcaaaaact ctgtgtgtgt atgtgtgtgt gtgacagagt gtgtgtgag'a 1260 

gagagacaga gagatacgct ttggttgcct ccataagctg gctgctatga ttaataagac 1320 

caagttttct aaagaaaatg agatcataac aaaagccctc tttatgacta tcttttatca 1380 

ggggcaaaaa ggaaagagac aaaacagcat gaaatgatga gaccaagtga tgaaaattca 14 4 0 

ttcacaatga ttgctttcaa gagtaatttc tcttgggtaa ttcagcagcc tgttactatg 1500 

gctctctgga gtgatagcta atgtaaatga agcctctaaa agtggattat cctgacaaga 1560 

atatactcag ccaataatgc aacagaaatc cattcaaagc attcgggaaa aattcaaaag 1620 

aataaatatt cttttttttt ttttaaagtt aatgacctac gatccatttc ttccctgact 1680 

aacaagcagc aagcacttaa aaatatccag ccaggatgaa atagaaaccc acctgacttg 1740 

ttaatatttt tgtttggtcc cagggactca gattctaagc caaattcttt gaatgatctt 1800 

ggcaaatgtc tcgaattatt tttgccaact tttctttatc ttggaaaaaa agtttcatga 1860 

atgggtgtca aaattgatta gttttaaaaa cctttcttgc agatacgtat ggcaccctaa 1920 

aactgtatta gaaaaaaatt tcatatctcc aggcctttca ttgggtcagg ttggcatttc 1980 

gctgcccttt atgtgtgtga caagtgaaaa taaggaaaga aaaaaactca agtgaagaaa 2040 

atcagaatct gcgcagcagt tcctgggcgt ttcagctgct tcccacatca cctgcctcat 2100 

caagccccag catccatctc cttgctcatc ttacaccctg tgtgcatgac aggcccacca 2160 

ttcatttatc agagcaaagg ctctcccact attctggttc acccccctac ttagccagat 2220 

atacaagaat atctgcacgg atgacctgcc tcacctggga gctcagagga gctcagattc 2280 

cattactatc gcaccaagga cagatctccc agcaagaatg acagaaaaga ctaactgccc 2340 

ccaaaatctc ccttccaaaa cacagttctc ttaattctcc caagaaacca gaatgtgact 2400 

gctcacctct ctaaggacct gaaaacaact ggccatttca gctatttaaa tcaactttaa 24 60 

aaaatccaac cgccaaaata ttaaaccatt ttggttggaa tgataacata actaacctgc 2520 

tgacagctgc ttctgctagg tgcaaaaatg gaaaaaaaaa tacttctaat caggtcaaat 2580 

cactctacct ttgggattct aaatttactc atattctcaa agaaatatat tcagtcatag 2640 

tggggaaaat aggattattc ctttagctcg ataagcaacc agaagttctt ccttcaaatc 2700 

ttgacattta atcaatcaga aattgatttt tggaaaactg tttcctatga agctatctct 2760 
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gcctgaagga tttttctttt acaatccaga ctatagaagg aaattcacaa cctggacttt 2820 
cacctccatt ggtcagagtt ttactgacca attcccacct ctgccttaca cctaacggaa 2880 
gtttatgcct gttttctctt cacatacccc aacagttaca aatggttgtt attattaagc 2940 
atcttttatt ttgtggcctc tgattacatg gtcccctaaa ttttgaccta atcacaaaag 3000 
attggtaaaa tttcttaaca tattaataat attttgttta tgtgtcaata tcttagcatg 3060 
tatcaattaa gacagaggtc ttaacgttct ctttttgaaa gagaatatta ggattcagag 3120 
atattaagag attctcccag gatcacagtt aggtaacaga gctggatttt agtccaggtc 3180 
tgtctacagc tctaacgtat atacaccctt tgtataacat gtcacgaatt cagcataaag 3240 
ggatcttcag tgatctaagt caggggtcag caaccttttc taaaaaggac caaatagtaa 3300 
tatttcaggc tttgtggacc ctatggtctc tatcataact gttcaaatca ccatgtagtg 3360 
taaaaggagc cataagcaaa atataaacta acgaatgtgg ctgttttatg ggattttttt 3420 
ttaactcttt atttacaaaa gcaggtggca gatcagaact cacttatggg ccatagttct 3480 
ctgacccctg acctgagaaa atcttatatt tatggacaac atttagactg tgacttgcca 3540 
agtaagaaca agaagctctg tcaactgaag gtcaaggctg gagttctgaa agcaaagagc 3600 
tgtctggtgt taatgataag tgaaatagtt aaagttagaa gatcccagtt ataagaagca 3660 
caaagaataa tgaccataga ctcctgaaca agaatgtctg gacttctggc ttaggcactc 3720 
ttgttgtatg gtccaggcca agttacctaa tctctccagg cctccatttt cttatcatta 3780 
aatgaagata ataaaagtat tttcctcaga gagctgtaag aataaactga gctaacccat 384 0 
gtcaagcaca tagaataggg cccagcctat attaatttat caataaatgc cag 3893 

<210> 3 
<211> 121 
<212> PRT 
<213> peptide 

<220> 

<221> PEPTIDE 
<222> (0) . (0) 

<400> 3 



Met 


Phe 


Asn 


Lys 


Cys Ser Phe 


His 


Ser 


Ser 


He 


Tyr Arg 


Pro 


Ala 


Ala 


1 








5 






10 










15 




Asp 


Asn 


Ser 


Ala 


Ser Ser Leu 


Cys 


Ala 


He 


He Cys 


Phe 


Leu 


Asn 


Leu 








20 






25 










30 






Val 


He 


Glu 


Cys 


Asp Leu Glu 


Thr 


Asn 


Ser 


Glu 


He 


Asn 


Lys 


Leu 


He 






35 




40 










45 








He 


Tyr 


Leu 


Phe 


Ser Gin Asn 


Asn 


Arg 


He 


Arg 


Phe 


Ser 


Lys 


Leu 


Leu 




50 






55 










60 










Leu 


Lys 


He 


Leu 


Phe Tyr He 


Ser 


He 


Phe 


Ser 


Tyr 


Pro 


Glu 


Leu 


Met 


65 






70 








75 










80 


Cys 


Glu 


Gin 


Tyr 


Val Thr Phe 


He 


Lys 


Pro 


Gly 


He 


His 


Tyr 


Gly Gin 










85 






90 










95 




Val 


Ser 


Lys 


Lys 


His He He 


Tyr 


Ser 


Thr 


Phe 


Leu 


Ser 


Lys 


Asn 


Phe 








100 






105 










110 






Lys 


Phe 


Gin 


Leu 


Leu Arg Val 


Cys 


Trp 

















115 120 



<210> 4 

<211> 6209 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 

<222> (0) . . . (0) 

<223> long version of mRNA 

<400> 4 

ggatgtgagt gggccttcag acttaaacca ggagttacac ctttggcttc cctggttctc 60 
agttctttgg acttggactg aattacactg ccaggtttcc tggttctcca gcttgcagat 120 
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ggcagatcat gggacttctt ggcctccata 
atatccagtt atgcattgct taacaatgga 
atttcatcat tgtgcaaaca tcatagagtg 
ctacacatct aggctacatg gtgtagcttg 
gataagtatg tatgtatcta ccatatctaa 
taatcttatg ggatcaccat catatatgca 
gtgcatgact gtatacgcac acatacacaa 
ctctgaagag ccctaataca atatgttata 
ggttttggtg aaacatatgt ggagatgggg 
ctgatgaatg ggtgggaggc atcatttggg 
aagagctcaa caagagcact ttatggcaaa 
aactgctaaa agtttcttta tgacagatga 
ttctttaaat actttcgaac actaaccttc 
agggccatag gaagtacaag tctgggagat 
aaaaataata gagtcactga aatgcagttt 
aaatcatact agtggtcaaa tactcttccc 
aataattgtt gtggtcacca ctgagctttt 
ttttattacc tgttccttaa caaacattga 
tcgggtatac agtccctgac tcttaaggac 
ctagtctttg gggtaccaca tgtttcttct 
aacacagtga ttgtgtagac aaaataagtg 
ggaaccactg agagttttta cttgtgctta 
ttcatgctgg aatatttttg gctttttaga 
aagactaagt aacatcactg aaattacagc 
aaaactaaag agcgaatcaa gaaaagcata 
gaaaaagaaa tgtacggaat agaggaggaa 
gaaaacacca ataagcctgc ttccaggaag 
ggtcacagca tgtaaggaaa ctgctgggct 
ataatcccat agccccatgg ggaaataatc 
agcactatgc taagtccttt atgtgaatta 
cccaaatatg tataccacta ttatcctacc 
taaatatcta acccaagtta agactgctag 
actcaggtat aataacatta tgctactgtt 
tcatgctatc taactaaaaa gctaagggaa 
aaaggagaga aaatactggg gaaaagtgat 
tgccagagat tgagtataac aagtgtgacc 
aggtaaaaaa aagagggagg tctgagaaat 
gtgagagtaa attctctttt agcattcaga 
ttttaccaac taaggaaaaa taacacttga 
tgctagaaaa actatgttgc agtggtttgc 
caaaaagatc agagaagaga agaaaggaaa 
tataacatat ttgggttgaa ggtaaaattt 
cagtttggtc cctggaacag cagcatcaga 
tctcaggtcc catcccagac tcactgaatc 
attcatatgc acagtagagc tggcgaagtt 
ttagtatttc tgagttttcc caaggaagaa 
gtttctcctt tacttactag ctgcatgact 
tgagtttcct catctataaa atggagacta 
atgttcaaca aatgctcctt tcattcctct 
agcagccttt gtgctattat ctgttttcta 
aactctgaaa taaataagct gattatttat 
agcaaattac ttcttaagat attattttac 
tgtgagcaat atgtcacttt cataaagcca 
catattattt attctacgtt tttgtccaaa 
tggtaatgta aaacaaactc agtacagtag 
taaacatatt cctcgtacca atgaagttac 
gatggaagta aattagtctt tatgttcccc 
gtgtgtatgt gtgtgtgtga cagagtgtgt 
ttgcctccat aagctggctg ctatgattaa 
cataacaaaa gccctcttta tgactatctt 
cagcatgaaa tgatgagacc aagtgatgaa 



attgtgtgag tcaatttcca ttttatttac 180 
gacaggttct gagaaatgca ttgttaagtg 240 
taactacaca aacctggaca gcatagacta 300 
taacctcatg ataagtatgt ataacatcat 360 
atgtagaaaa ggtacagtaa aaatatggta 420 
atcctttgta gactgaaatg tcattgtgta 480 
acacacacaa atatactatt ggttcttttt 540 
catttatatt gactctattt caaaatttat 600 
cataggtgtg tgaactggga tagtgtcctg 660 
acaagcccag ggcatcagct tatagatatc 720 
acctcccaca agacctctca gaagttgaga 780 
catttatgga taaaataggg attagcagga 840 
atttctacca ggcagtgggg ccccaagtgc 900 
actaggctgc actgtctgta gagaatctga 960 
ggtataatta ttgccatgca tcataattct 1020 
tgaaaaaaca ttttcttggt ttgaattcta 1080 
aaatatataa atactttcaa gtttgcatat 1140 
attcaacatg aaaatgatta tgggaaacat 1200 
tcaggtaaat acttagggta tttcatggcc 1260 
tcaaatcaca gattcaaaat caagaatgat 1320 
aaccaaaatt gcttgcttct gtcattctat 1380 
aaattttgaa tagtaaaaca gagtgtcaac 14 40 
cacaatttta agtacatgaa gtatttttac 1500 
tttcttcttt ttaaaactgg tatttgttat 1560 
attattactg attattacag gattattact 1620 
ggagttaaca aatgatccac tctgggtgtt 1680 
tgcctaagac agagctggct cagcttgctg 1740 
acatgccacc atcctcagtt gtccagatag 1800 
tttaattatg atatagctga caccattcaa 1860 
acttttgtca aatttatttt tcataaataa 1920 
ttaaagagga gaaactgagc tcctaaagtt 1980 
tcaccctagg ctattaactc aggcagtcta 2040 
tgcagctttg actatgcctg aattataacg 2100 
ataaaatgag ccatagggct caatttcata 2160 
aatgcagagt ttaaaatatt tttgtaaaag 2220 
aaaaaaaaaa aaaaaaaaaa aaaaggaaga 2280 
agaaatatca gaggaaggaa ataaaggagg 234 0 
ttccacagat tccacaaatc acatttcttt 2400 
cctaacattt cattgcagtt agctaaagga 24 60 
tctaatttct tcaggaatag agaaaagtga 2520 
ctatcagaaa aatacagaat tggagtagga 2580 
tatattgtaa tcttaagtat cttgctactt 2640 
atctgccgag ggcttgttaa aaaggcagaa 2700 
agaatataaa tactgacaag atgccccggg 2760 
ccattgtagc ctgtgattgt tttctgcaac 2820 
aacccaggcc ttagcttctg gcagacttgt 2880 
catgagcaag gaaatcaaac tttatgtgcc 2940 
taataatcat ctcctaggct tgttttgagg 3000 
atttacagac ctgccgcaga caattctgct 3060 
aacttagtaa ttgagtgtga tctggagact 3120 
ttattttctc aaaacaacag aatacgattt 3180 
atttctatat tctcctaccc tgagttgatg 3240 
ggtatacatt atggacaggt aagtaaaaaa 3300 
aattttaaat ttcaactgtt gcgcgtgtgt 3360 
tattcagtac agtatttaag cccctgtact 3420 
atgaaaagca aatttgtgtg agatatcgta 3480 
acaaattgaa atgcatttca aaaactctgt 3540 
gtgagagaga gacagagaga tacgctttgg 3600 
taagaccaag ttttctaaag aaaatgagat 3660 
ttatcagggg caaaaaggaa agagacaaaa 3720 
aattcattca caatgattgc tttcaagagt 3780 
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aatttctctt gggtaattca 
aaatgaagcc tctaaaagtg 
gaaatccatt caaagcattc 
aaagttaatg acctacgatc 
atccagccag gatgaaatag 
gactcagatt ctaagccaaa 
ccaacttttc tttatcttgg 
taaaaacctt tcttgcagat 
atctccaggc ctttcattgg 
tgaaaataag gaaagaaaaa 
gggcgtttca gctgcttccc 
ctcatcttac accctgtgtg 
cccactattc tggttcaccc 
cctgcctcac ctgggagctc 
tctcccagca agaatgacag 
gttctcttaa ttctcccaag 
acaactggcc atttcagcta 
accattttgg ttggaatgat 
aaaatggaaa aaaaaatact 
ttactcatat tctcaaagaa 
agctcgataa gcaaccagaa 
gatttttgga aaactgtttc 
tccagactat agaaggaaat 
tgaccaattc ccacctctgc 
taccccaaca gttacaaatg 
tacatggtcc cctaaatttt 
aataatattt tgtttatgtg 
cgttctcttt ttgaaagaga 
acagttaggt aacagagctg 
accctttgta taacatgtca 
ggtcagcaac cttttctaaa 
ggtctctatc ataactgttc 
aaactaacga atgtggctgt 
gtggcagatc agaactcact 
tatatttatg gacaacattt 
ctgaaggtca aggctggagt 
atagttaaag ttagaagatc 
tgaacaagaa tgtctggact 
acctaatctc tccaggcctc 
ctcagagagc tgtaagaata 
gcctatatta atttatcaat 



gcagcctgtt actatggctc 
gattatcctg acaagaatat 
gggaaaaatt caaaagaata 
catttcttcc ctgactaaca 
aaacccacct gacttgttaa 
ttctttgaat gatcttggca 
aaaaaaagtt tcatgaatgg 
acgtatggca ccctaaaact 
gtcaggttgg catttcgctg 
aactcaagtg aagaaaatca 
acatcacctg cctcatcaag 
catgacaggc ccaccattca 
ccctacttag ccagatatac 
agaggagctc agattccatt 
aaaagactaa ctgcccccaa 
aaaccagaat gtgactgctc 
tttaaatcaa ctttaaaaaa 
aacataacta acctgctgac 
tctaatcagg tcaaatcact 
atatattcag tcatagtggg 
gttcttcctt caaatcttga 
ctatgaagct atctctgcct 
tcacaacctg gactttcacc 
cttacaccta acggaagttt 
gttgttatta ttaagcatct 
gacctaatca caaaagattg 
tcaatatctt agcatgtatc 
atattaggat tcagagatat 
gattttagtc caggtctgtc 
cgaattcagc ataaagggat 
aaggaccaaa tagtaatatt 
aaatcaccat gtagtgtaaa 
tttatgggat ttttttttaa 
tatgggccat agttctctga 
agactgtgac ttgccaagta 
tctgaaagca aagagctgtc 
ccagttataa gaagcacaaa 
tctggcttag gcactcttgt 
cattttctta tcattaaatg 
aactgagcta acccatgtca 
aaatgccag 



tctggagtga tagctaatgt 3840 
actcagccaa taatgcaaca 3900 
aatattcttt tttttttttt 3960 
agcagcaagc acttaaaaat 4020 
tatttttgtt tggtcccagg 4080 
aatgtctcga attatttttg 4140 
gtgtcaaaat tgattagttt 4200 
gtattagaaa aaaatttcat 4260 
ccctttatgt gtgtgacaag 4 320 
gaatctgcgc agcagttcct 4380 
ccccagcatc catctccttg 4440 
tttatcagag caaaggctct 4500 
aagaatatct gcacggatga 4560 
actatcgcac caaggacaga 4 620 
aatctccctt ccaaaacaca 4680 
acctctctaa ggacctgaaa 4740 
tccaaccgcc aaaatattaa 4800 
agctgcttct gctaggtgca 4860 
ctacctttgg gattctaaat 4920 
gaaaatagga ttattccttt 4980 
catttaatca atcagaaatt 5040 
gaaggatttt tcttttacaa 5100 
tccattggtc agagttttac 5160 
atgcctgttt tctcttcaca 5220 
tttattttgt ggcctctgat 5280 
gtaaaatttc ttaacatatt 5340 
aattaagaca gaggtcttaa 5400 
taagagattc tcccaggatc 54 60 
tacagctcta acgtatatac 5520 
cttcagtgat ctaagtcagg 5580 
tcaggctttg tggaccctat 5640 
aggagccata agcaaaatat 5700 
ctctttattt acaaaagcag 5760 
cccctgacct gagaaaatct 5820 
agaacaagaa gctctgtcaa 5880 
tggtgttaat gataagtgaa 5940 
gaataatgac catagactcc 6000 
tgtatggtcc aggccaagtt 6060 
aagataataa aagtattttc 6120 
agcacataga atagggccca 6180 

6209 



<210> 5 
<211> 80000 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> gene 

<222> (0) . . . (0) 

<223> human genome sequence 

<400> 5 

cacacgtagg ctacgagtgg ccctcagcct 

atgtttaatt gtgctgtttt cttatagagg 

gacacccaga gagaatcaca ggttttcaga 

gtcatatctc tggatctctg gttctcacat 

gccatgtcat tcatatcctg tgtaatatgg 

aaagcacctg caatttttct ttgccaagca 



gcctcatcat ggacctgtgt tataataaat 60 
aaagtcctga tgttagttgc cttgaagtca 120 
ttaattcatc gcttgattct tatccctgaa 180 
tataaatttc aatgattctt tttctatatg 240 
ggaaactgag gtatgaatga catcattcaa 300 
cttacagctt tttctcatgt tgctttcaaa 360 
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aagtcattga aatattgttc acatattttg 
aggtatcttg tccaagttct tacagccaga 
ttctacctct tggcttatgc tcttttcatg 
acttcatgtt tcaactattt atcaaatact 
gtgggatccc ctgaagcctg ctgattattt 
ccttactctc ctatggtaaa gttttattgt 
gaaatcctac agatcatact caacaacatg 
atttatgaat actttatgtc aaacataagc 
aaattgacac tttttgacat aaccatattt 
tatttattta tttatttatt tatttttgag 
gtgcagtggc aacatcatag ctcactacag 
ccatctcagc ttcccaagta gctgggacta 
ttcgtatttt gtagagacgg ggttttgcct 
tgttccgccc accttggcct cccaaagtgc 
actttttagg gttgaaattt atattgactg 
tctgctaact aatttatatt accatccaac 
tcactttttt tgtgtgaaca ttgaatacac 
gtcttattct ttattagagt taataatatt 
acttgtttgt gatggctacc tctttttttc 
aattagcacg gctgggcaaa aacaatgaaa 
gggtaaatga atataattca aaataccata 
ctgttggaaa tgtaaaagtg acatatgttc 
gttgagggaa ctggacttat acacaaaata 
catgagctaa agtctttggc aaagacatta 
ttactgtggg ctggaatggt ctgggaacat 
aaggatgagg ttcaaagaga tgggaaaaaa 
aaaaacaagt tgccagaaag aacaaggaag 
ctagggaagg taatataact agagacggca 
actttagccc tcattggtac gtgcagaaat 
gtaaatatat gttgggaaaa aaataaccgg 
gagaaaccta ttccatgcta tgaaatattt 
tacttttcca agccacaaga ggaaacattt 
tttctccact acgtctactc ttaagctctc 
aaacctccta gatgatatac ttatggaaat 
ggaaagacat ttctagcagg agagcatgaa 
ggttctgact gtggtcactc taactataaa 
catttccttg gggtccctaa aaatttcaag 
tgatttctca acacattcag atatcttatg 
tcagaaactc atatattaca gaataaaaaa 
aatgagagca gggttctatt tcattttgga 
tgataaaatg ttaacattag ttaattatta 
tatactttta tatgtatcac ctacatttct 
tctacctcgt atgattgctg gatgatttaa 
aaagcagaaa cttttagtta ttaagattct 
cctgctaaga aaccccaata atattctgat 
taatctaaca tcttcatatt aattttttta 
gtgtttctaa acgctggatg catttttaac 
ttttgtttta actatgattc agcttataac 
ttaataaggt taatgaaaag ttgacttatg 
tgttttcctt gctaaggata ttagtacttg 
tgaaactttg atgcaaatat cttatcaata 
tttcttttct tctttctttt ttgagacagg 
acccctggcc cctggcctca agtgatcctc 
acagatgtga gtcactgaat ccagcctcac 
ctggttctaa aactttttga taatactctc 
tggattacgt agttatgaat ttcataaatg 
ttatacagga tgtataacat aaccatggtt 
aaagactggt caggaccagg gcacgcatgc 
gccaggcctc ctgtgaaaaa aatcagtaag 
ttcctgaagt cactaccaca ctttgcactt 
gaattctggt gagaacgaca cgtctgaaat 



cagatgagga aatgaatatt caaatgcatt 420 
aagtagagaa atgaatttga attacaaatc 480 
acactgggaa taaatgtctg aacaagcatg 540 
tgttttctac taagatcttg cactcactca 600 
gtcctttggc atttatcact ctctgtggga 660 
tattaaaagt attatttgac aataaatgta 720 
tctaatgtca gcacacaatg tctaacaatc 780 
aataacctaa ttaaggaagg tatttttaat 840 
caagtggctc cattgttttg tttatttatt 900 
aaagggtctc actctgttgc ccagactgga 960 
cctcgacctc tctgggctca agcaatcctc 1020 
caggtgtgta ccatcatgcc aggctaattt 1080 
ggtcgtccgg gttggtctca aactcctggg 1140 
tgggattata ggcatgagcc tcaagtggct 1200 
tcaactagct tccctagtta gtatttggga 1260 
ttgtcaacat ttgttgaaat ataactgtcc 1320 
tttcagacta aatttggttt attacttaat 1380 
tcttaatact ttgccttcca caaatgaata 1440 
tcttagcctg tcacaggtat tatggataaa 1500 
gaaatacact tgcctgggaa agctggggag 1560 
tatttattca acactgttgg aatatatgtc 1620 
tcttcctggg tctcagactt ttaggatcta 168 0 
caattcaaca acattatgag ctagaaaatc 1740 
ggtaacatga ggagtcagga aaaggagaaa 1800 
gagatggagg aagtggcttg ttactggaga 1860 
agaaagagag aagaaagaaa agaaatgagg 1920 
aatagaggca ggtaagcagt ggattttgcc 1980 
gtttctaaca ggccatgatg aataagatac 2040 
tcaaatttgg aaattcaagc ttacatgaca 2100 
taaacattta catcagctct ttttcctaaa 2160 
gtcacaattc tgttttcaaa atacttgctc 2220 
tctctgccaa cactctctga ccttaaccag 2280 
tttagagctg tgtgtatctc gtctttatgt 2340 
attcaggcaa ctttttcatg aactttacca 2400 
tagaaatgga ctcttcccca gtctctgctg 2460 
aagtgtgtaa aaatcatgag cagattattt 2520 
gtatctgtat tagcacagga agatttaaat 2580 
aactttatta agataaattt cctccagcat 2640 
taaagcagaa aattagtgta cctggctaaa 2700 
aagtcactaa gacagtaata ataccattaa 2760 
gatgtgtttt tgtatgccag ccacataata 2820 
agatgtaaat gtgagggaat tatagtagta 2880 
atgagctgtt gtctcaaaaa cttggtatag 2940 
tactattcca atatttgaat aaaacagtga 3000 
acatcaaaac cttctggcat tagatgtttc 3060 
tgttttgatt atctacattc agtagtgaat 3120 
taaatgtgtt ttgtaccaca ttttgacaac 3180 
aaaacaaaac aatgcatctt ctctccactg 3240 
aaaaaaatcc taatttatgc acattctcat 3300 
acgattctgt aacaaagaat tatcatggga 3360 
caatgtgctt gattttacct agatgagatt 3420 
gttttgctgt gttacccagg ctagcctcaa 3480 
tcacctctgc ctcccaaagt gctgggtatt 354 0 
ttagttggct ttcttagtga attattttat 3600 
aaatatttat ggattttata acataattta 3660 
attttgtgat attgccacag atcatcacca 3720 
taatatattt tcataaacta tagaccaaac 3780 
attttatatg tgtggtgcct attggaatat 3840 
tgcttatctc ataggaccaa cggcccaaca 3900 
atctccatgt ggaaatagat agccactgtt 3960 
ctctcagctt cacaacccct attacagccc 4 020 
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tcagagaatc ttctcacata gcgccaaaca 
aatcaatttc taaaattaaa agtgaaaaca 
aaggctgatg ttactaaaag acaaaagaca 
ctcagggaaa tgcaggaaga aataaaagat 
tttatgtgtg gacccgaccg atctatggga 
gggtgcgtca gtgagggtgt ttttggaaga 
agcagatggt cctcaccaat gtgggtgcac 
caaaaaaggc aaaagaaggg tgaattccct 
tcctaccctc agacatgaga ggtttggatt 
ggtggccacc tctggtttca ggtctttggc 
ttggttcttg ggccttgaga ctcaagctaa 
tgtagggaca gcaaatcatg aaaccttctg 
ataaatctgt gcttatatat ctatagcttt 
atgtacacac tatatgacct aacctgtagt 
tgatggtatt ctcagtatct gggaggtatg 
taggactgag gagatttctg aaatgtggaa 
gccatgtgga aaacatgtag aactatcgta 
cccaaaggaa aggaaaccag tatattaaag 
ttattctcaa tagcctagat atggactcaa 
atgtggcata tgtacaccat gaaatactta 
tggtgtgtgc ctgtcatccc agctccttgg 
aagtttgaga ccagcctggg caaaataata 
ggccttcctt aaaaaaagaa taaaatcatg 
gatattactg ttaagtgaaa ttagccagga 
catatgcgga agctaaaaaa acgttgatct 
tggagactag aaaaggtagg gagaaggaag 
aaacaaaatt acagttagtt agggagaatt 
actatagtta ataataatac tttaattagt 
ttcccaacac acacaaaaaa atgataatgt 
tctgatcact ctacattata tgtatcaaca 
tttttatttg tccatttaaa aaagataaca 
tgttccatgt aaacctggat gaactggtca 
aaagtttccc aggctccaca tcaagttgtt 
ttgggcaaat agtaatttac agcaatagtg 
tctttggcat ctgcttcttc ttttctgtaa 
ttcttcccta gagctattca gggtgacctt 
tatatcaaaa taaaacagtc ctgtgtggcc 
aagataggat cggctgctct ttcttctcct 
gtcacccaga ctggagtgca atggcacagt 
cctcctcagc ctcctgagta gctgggacta 
gatatatata tatatatata ttatatatat 
atattatata tataatatat atattttata 
atatataaaa tatatatatt atatatatat 
gtcacccagg ctgaagatca gctgctcttt 
caggggataa aatactacct ggaataaagg 
atttttatag tatgacagag gcagagaaaa 
ccactcccag tacaatcatc tgtctgttac 
ggggtacaaa gtagatgcca aacttcacaa 
atcaaacctt tgtgaaagtt tccagctttt 
gaatctcaag agcataattg ccaagatagt 
tgaaatctgt tcccatcatt actcaagata 
tgctcaataa tgattgattg gccaatggag 
tgcaaggggc ccacaggaat cagtatgatc 
aagaaacaga tagtggcttg tattaggttt 
aaaagaaatt tgatcagagc ataaatctga 
aggaagaaaa cacagattat agacaggaaa 
atactaaaaa tgtacattca tttgaacaaa 
ttaccagcac taccatgccc aaactacctt 
gcacctgaaa tttaaaacaa aaaccaattt 
gcctgctaac aatgtcaaag tctcgtttac 
caaacgagtt caaaataaaa agcaaaccaa 



acaactttag gaagtgatgt tcctagaatg 4 080 
atgacaagga gaagggaggg tcagagagga 4140 
gtataacctc ttatgaggat ggtccagaca 4200 
aggagtttga accacactgt gatggctaac 4260 
cacccagata gcttgtaaag cactatttct 4320 
gatcaacact tgagtcagta gactgagtaa 4380 
attgtttaat ctgttgagtg cctggataga 4440 
ttctcgtctt aagctgggac agccatcctt 4500 
cttggatctt tggtcccaag ggctgacact 4560 
cccagattgt aagttacacc atcagcttcc 4 620 
aatacactac cagcttccct tgttctctag 4680 
cctccataat catataagtc aattcccgta 4740 
ccttttggtc tgtttctcca aagaacctta 4800 
aatgataacc ttatgcaggt ttgaataaga 4860 
ggctagagtg atgaaccacc gccatgagcc 4 920 
tatttggtgt caaaaccaag agataatata 4 980 
tgattcagca acccaaccac tgggaattta 5040 
agaatctgca ctcccatggt tattgcagca 5100 
cctaggagat tagatgaatg gacaaagaaa 5160 
ccagctataa aaaagaatta gccaaagcag 5220 
gaggctgagg tgggaagatc tcgaggccag 5280 
agactcggtc tctaaaacaa tttaaaaata 5340 
tcattcacgg caacatagat gggactggag 5400 
acagcaagtt aaaccccaca tattctgatt 54 60 
catagaagta aaaagtagaa cagaggatgc 5520 
ggagagggaa aaatttgtta acaggtacaa 5580 
aattccagca tcctgtagca ctataggatg 5640 
ctcaaatagc tagaaggagg atattgaatg 5700 
atgagatgat ggatatggta gttatcctga 5760 
catcactatg taccccacaa atatgtagaa 5820 
aatttaaaaa taaaataaaa actaaattag 5880 
ccctacgtct gcccatctag atggctggtc 5940 
ccactgctca ctggaacttc cctagtcagg 6000 
aatttatcac tgacatttct tcagttcccc 6060 
tgctgtttgt tgaaatgccc aacattcttt 6120 
tcttttcgca ttttcccatg ccacttccat 6180 
actgctcatg accttgtttc ctgccatgtg 6240 
cctttttttt cagagacagg atctctccct 6300 
cgtagcttgc tgcagcctcg aactcctgga 6360 
caggtgcaca ccaccatgcc ttcctaatct 6420 
aaaatatata tataaatata tatattttat 6480 
tataaaatat atatattata tatatatatt 6540 
atatatatta tagagatggg gtcttgctct 6600 
ctaatctgtg gttagataag atctgtctcc 6660 
tatctttaaa ataatcccag agaagaaaac 6720 
cagagaatat ttgttaaggc aggactttca 6780 
ctgcatacct tacacgggct ggcactgctg 6840 
tggttagatt catgtttaaa aagccattgg 6900 
ttctgttcca aatatgtgtc cattataaaa 6960 
ctatgtccat gagtatttca acatctctca 7020 
ttgtatgaac agtattccac ataaactagg 7080 
ggtcattatt taatgcacta caatctttta 7140 
ccataggaat ccttttcttt tccattgaaa 7200 
cttgtgtgtg ttgtgaggtg gaaagatatg 7260 
gcccatggga taggaaagaa tgagggaata 7320 
atcaaaccta ttaaaactga taattttcga 7380 
aagattctat aaagcaagat ttctctgttc 7440 
aggaaatgaa tagcagagtc aaacttaaaa 7500 
acattttatt taagaaaagc aaacagatgg 7560 
aaagaaaaaa acaaatctgg aacctgaagt 7620 
taaacagaaa ccaacataaa cagaagttac 7680 
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taccatctcc 
tctcacctgg 
cacattcata 
ctttggatct 
tcatatctat 
ttggcaattt 
tcttaaacat 
aagagaattt 
cctaaaggtc 
tcattgtgtt 
gtggctttcc 
gtttccaaat 
tctcctgatc 
tcatcattgc 
acatatgcag 
gttagcaagg 
tctctcattt 
gctcaggcca 
ccaagtagac 
actaattggt 
aggatgacca 
agagatcagc 
ggcatcatcc 
cttttcttct 
acttaaacca 
aattacactg 
ggcctccata 
taacaatgga 
tcatagagtg 
gtgtagcttg 
ccatatctaa 
catatatgca 
acatacacaa 
atatgttata 
ggagatgggg 
atcatttggg 
ttatggcaaa 
tgacagatga 
actaaccttc 
tctgggagat 
aatgcagttt 
tactcttccc 
ctgagctttt 
caaacattga 
tcttaaggac 
tgtttcttct 
aaaataagtg 
cttgtgctta 
gctttttaga 
aaattacagc 
gaaaagcata 
agaggaggaa 
ttccaggaag 
ctgctgggct 
ggaaataatc 
atgtgaatta 
ttatcctacc 
agactgctag 
tgctactgtt 
gctaagggaa 
gaaaagtgat 



ctcagcctgt 
aagacgagca 
aaccaccgac 
tcctcatggg 
tagtttgaac 
caaatgatta 
atgctgttac 
cttttggtca 
tgggtctggg 
ctgagctgtc 
atcgttggca 
tggcattatt 
tctctcacct 
ccaagtgtgt 
agatggctct 
ccactgagaa 
cctggtttct 
aactagggtg 
agaccatcta 
aagactatat 
ggtagctggt 
atttgaattg 
aatctgtcga 
tgatctagta 
ggagttacac 
ccaggtttcc 
attgtgtgag 
gacaggttct 
taactacaca 
taacctcatg 
atgtagaaaa 
atcctttgta 
acacacacaa 
catttatatt 
cataggtgtg 
acaagcccag 
acctcccaca 
catttatgga 
atttctacca 
actaggctgc 
ggtataatta 
tgaaaaaaca 
aaatatataa 
attcaacatg 
tcaggtaaat 
tcaaatcaca 
aaccaaaatt 
aaattttgaa 
cacaatttta 
tttcttcttt 
attattactg 
ggagttaaca 
tgcctaagac 
acatgccacc 
tttaattatg 
acttttgtca 
ttaaagagga 
tcaccctagg 
tgcagctttg 
ataaaatgag 
aatgcagagt 



gaaattctgg 
gagtgaacac 
tccaggagaa 
cttcaaggca 
catgcaaaat 
accctaatac 
atacaaaatc 
ctactccttc 
tcatttctcc 
tcataaaaca 
cttcgttggc 
attgttatga 
gtacatttct 
cttctttctt 
tggttttcct 
gagaacaaaa 
agaagtaaga 
atcaaactga 
caaatcttca 
gtgtgataat 
aaaacattat 
gtgaacttag 
aagcttgaat 
tatcatcttc 
ctttggcttc 
tggttctcca 
tcaatttcca 
gagaaatgca 
aacctggaca 
ataagtatgt 
ggtacagtaa 
gactgaaatg 
atatactatt 
gactctattt 
tgaactggga 
ggcatcagct 
agacctctca 
taaaataggg 
ggcagtgggg 
actgtctgta 
ttgccatgca 
ttttcttggt 
atactttcaa 
aaaatgatta 
acttagggta 
gattcaaaat 
gcttgcttct 
tagtaaaaca 
agtacatgaa 
ttaaaactgg 
attattacag 
aatgatccac 
agagctggct 
atcctcagtt 
atatagctga 
aatttatttt 
gaaactgagc 
ctattaactc 
actatgcctg 
ccatagggct 
ttaaaatatt 



aacttctctt 
atcaggggtt 
tgtaggaagc 
aaagaaaaaa 
tacagttttt 
attatgcttt 
aggtatattc 
tgacatgagg 
acctctaact 
tctgctatga 
cttattggta 
ttcctgctgc 
tacattttct 
ctccttgtca 
tctgaaatct 
gggaaacata 
cccaaagaac 
gcttctgagt 
ctgttaaata 
taaatttatg 
tctgggtgtg 
taaagcagac 
aaaacaaaaa 
tcctgccctt 
cctggttctc 
gcttgcagat 
ttttatttac 
ttgttaagtg 
gcatagacta 
ataacatcat 
aaatatggta 
tcattgtgta 
ggttcttttt 
caaaatttat 
tagtgtcctg 
tatagatatc 
gaagttgaga 
attagcagga 
ccccaagtgc 
gagaatctga 
tcataattct 
ttgaattcta 
gtttgcatat 
tgggaaacat 
tttcatggcc 
caagaatgat 
gtcattctat 
gagtgtcaac 
gtatttttac 
tatttgttat 
gattattact 
tctgggtgtt 
cagcttgctg 
gtccagatag 
caccattcaa 
tcataaataa 
tcctaaagtt 
aggcagtcta 
aattataacg 
caatttcata 
tttgtaaaag 



tctttctcgc 
gtcagttccc 
ttagttaagg 
aagtttgctt 
ataggtaaaa 
tgggtataga 
ctgcttctat 
tatgaaccaa 
tgtgccgctt 
ctttactttc 
tgctttatac 
tctcccacat 
cctgtgcttc 
catttccttt 
catagtttgg 
agagaaacca 
tcactgtttc 
gctgatcaaa 
ccataaagaa 
catcaacctg 
tccataagag 
ggctctcacc 
gaggaaggga 
ggatgtgagt 
agttctttgg 
ggcagatcat 
atatccagtt 
atttcatcat 
ctacacatct 
gataagtatg 
taatcttatg 
gtgcatgact 
ctctgaagag 
ggttttggtg 
ctgatgaatg 
aagagctcaa 
aactgctaaa 
ttctttaaat 
agggccatag 
aaaaataata 
aaatcatact 
aataattgtt 
ttttattacc 
tcgggtatac 
ctagtctttg 
aacacagtga 
ggaaccactg 
ttcatgctgg 
aagactaagt 
aaaactaaag 
gaaaaagaaa 
gaaaacacca 
ggtcacagca 
ataatcccat 
agcactatgc 
cccaaatatg 
taaatatcta 
actcaggtat 
tcatgctatc 
aaaggagaga 
tgccagagat 



cttcttcttc 
cagatggcac 
ccaaagttct 
gagaatatct 
tgagtgcata 
aatattcaga 
aattaaagca 
gttcaggacc 
tcttggtcag 
tcctgatagg 
actggttctc 
ttcccatctt 
cttcttccca 
gcccgctctc 
aggtaaactt 
agtcactatc 
agtgctttca 
acctataaaa 
tgaaaaggtc 
gctaggctaa 
tgttttcgga 
aataagggca 
aaatttgctt 
gggccttcag 
acttggactg 
gggacttctt 
atgcattgct 
tgtgcaaaca 
aggctacatg 
tatgtatcta 
ggatcaccat 
gtatacgcac 
ccctaataca 
aaacatatgt 
ggtgggaggc 
caagagcact 
agtttcttta 
actttcgaac 
gaagtacaag 
gagtcactga 
agtggtcaaa 
gtggtcacca 
tgttccttaa 
agtccctgac 
gggtaccaca 
ttgtgtagac 
agagttttta 
aatatttttg 
aacatcactg 
agcgaatcaa 
tgtacggaat 
ataagcctgc 
tgtaaggaaa 
agccccatgg 
taagtccttt 
tataccacta 
acccaagtta 
aataacatta 
taactaaaaa 
aaatactggg 
tgagtataac 



7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 
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aagtgtgacc aaaaaaaaaa aaaaaaaaaa 
tctgagaaat agaaatatca gaggaaggaa 
agcattcaga ttccacagat tccacaaatc 
taacacttga cctaacattt cattgcagtt 
agtggtttgc tctaatttct tcaggaatag 
agaaaggaaa ctatcagaaa aatacagaat 
ggtaaaattt tatattgtaa tcttaagtat 
cagcatcaga atctgccgag ggcttgttaa 
tcactgaatc agaatataaa tactgacaag 
tggcgaagtt ccattgtagc ctgtgattgt 
caaggaagaa aacccaggcc ttagcttctg 
ctgcatgact catgagcaag gaaatcaaac 
atggagacta taataatcat ctcctaggct 
tcattcctct atttacagac ctgccgcaga 
ctgttttcta aacttagtaa ttgagtgtga 
gattatttat ttattttctc aaaacaacag 
attattttac atttctatat tctcctaccc 
cataaagcca ggtatacatt atggacaggt 
tttgtccaaa aattttaaat ttcaactgtt 
agtacagtag tattcagtac agtatttaag 
atgaagttac atgaaaagca aatttgtgtg 
tatgttcccc acaaattgaa atgcatttca 
cagagtgtgt gtgagagaga gacagagaga 
ctatgattaa taagaccaag ttttctaaag 
tgactatctt ttatcagggg caaaaaggaa 
aagtgatgaa aattcattca caatgattgc 
gcagcctgtt actatggctc tctggagtga 
gattatcctg acaagaatat actcagccaa 
gggaaaaatt caaaagaata aatattcttt 
catttcttcc ctgactaaca agcagcaagc 
aaacccacct gacttgttaa tatttttgtt 
ttctttgaat gatcttggca aatgtctcga 
aaaaaaagtt tcatgaatgg gtgtcaaaat 
acgtatggca ccctaaaact gtattagaaa 
cttaaaggac accctctttt acaaactcac 
tagctgttgt tattgctttc tatataccta 
tggcaggtgt ggtggctcac acctgtaatt 
tcacctgaga tcaggagttc aggaccagcc 
gaaaatacaa aaatcacccg ggtgtggtga 
gctgaggcag gagaatcact tgaactcagg 
ccactgcact ccagcctggg tgacagagca 
gactggtttt tcaacagcta ttcccacccc 
ttttcctagt ttgggtaatg gccctctggg 
tgcaaactat gtttagaagc atgtctggga 
gtaggctttg catgaatgga aaaggagagt 
cagtgagaga agcaaaagtg gggaagcaag 
caggtaggag gattggctca gtcatcttgg 
gtggactaga ttaatttcat ccattacagg 
cattgggatt tgatggtaga aggtattttg 
tgacattctt tgaattcctt tcaagcaaag 
aaaacataaa aagtattgtt gctctgctca 
ttccatgatg aaatgacata aatgaggttt 
caaggcaagg tagctacctg ggcagagctg 
ggttaattgg ccatggaagg cagtcattaa 
ttcccagctt ctgcatcctt ccctgtccct 
tcccatcagc ctcatgaagt tctctctcat 
gaaaatgtgt ccagtaatgc ctgattggcc 
gaagctaaac tgagaaatct tgcaaatcat 
atcatttatt gactagtatg tggcaggcag 
ggggggggcg gggttcacac atcttaaaga 
agtcaagaga gtaaggaatt gtctttgttt 



aaaaggaaga aggtaaaaaa aagagggagg 11400 
ataaaggagg gtgagagtaa attctctttt 114 60 
acatttcttt ttttaccaac taaggaaaaa 11520 
agctaaagga tgctagaaaa actatgttgc 11580 
agaaaagtga caaaaagatc agagaagaga 11640 
tggagtagga tataacatat ttgggttgaa 11700 
cttgctactt cagtttggtc cctggaacag 11760 
aaaggcagaa tctcaggtcc catcccagac 11820 
atgccccggg attcatatgc acagtagagc 11880 
tttctgcaac ttagtatttc tgagttttcc 11940 
gcagacttgt gtttctcctt tacttactag 12000 
tttatgtgcc tgagtttcct catctataaa 12060 
tgttttgagg atgttcaaca aatgctcctt 12120 
caattctgct agcagccttt gtgctattat 12180 
tctggagact aactctgaaa taaataagct 12240 
aatacgattt agcaaattac ttcttaagat 12300 
tgagttgatg tgtgagcaat atgtcacttt 12360 
aagtaaaaaa catattattt attctacgtt 12420 
gcgcgtgtgt tggtaatgta aaacaaactc 12480 
cccctgtact taaacatatt cctcgtacca 12540 
agatatcgta gatggaagta aattagtctt 12600 
aaaactctgt gtgtgtatgt gtgtgtgtga 12660 
tacgctttgg ttgcctccat aagctggctg 12720 
aaaatgagat cataacaaaa gccctcttta 12780 
agagacaaaa cagcatgaaa tgatgagacc 12840 
tttcaagagt aatttctctt gggtaattca 12900 
tagctaatgt aaatgaagcc tctaaaagtg 12960 
taatgcaaca gaaatccatt caaagcattc 13020 
tttttttttt aaagttaatg acctacgatc 13080 
acttaaaaat atccagccag gatgaaatag 1314 0 
tggtcccagg gactcagatt ctaagccaaa 13200 
attatttttg ccaacttttc tttatcttgg 13260 
tgattagttt taaaaacctt tcttgcagat 13320 
aaaagtaagt actctgtagt gtgaaaaatt 13380 
aaaaacagcc tttggaatac ccacatgaag 134 40 
catcttgtct attataaaaa gactggtttt 13500 
ccagcacttt gggaggccaa ggcgggcgga 13560 
tgatcaatat ggtgaaaccc agtctttact 13620 
cgggcgcctg tagtcccagc tactcgggta 13680 
agtcagaggt tgcagtgagc tgagatcatg 13740 
agactccatc tcaaaaaaaa aaaaaaaaaa 13800 
tctgcatgga aatattcacc cagtcaattg 13860 
caggactgga gtggggcaca caggagaagc 13920 
aatgtcatgc aagaaaagac atatttaaag 13980 
aattctatgt agagcagagc ctcttacttg 14040 
aggaattatg cttttcatca gccaaatttg 14100 
ctgaggctca tgaaaccagg tgtaaagaaa 14160 
aagaggagcc gtgaaagata atccagaaat 14220 
ggactattcc atttgaaatg agaaggtacc 14280 
gattaaattt acccatgagt tgactcagaa 14340 
gagttttatc taactcattc tcacttctta 14400 
tttattgttg ttgttgttgt tttctggaca 14460 
ttttatttct ctatgccgtg gagagaaatt 14520 
gatgttccca tgcgagtgaa ctttccaggg 14580 
caattccatt gttggtgatg acaatgtctc 14 640 
ttattaaaat ttgctttcag gaaaaatttt 14700 
ccttatccta aaggcttaaa ctggaggaag 147 60 
tgagccaaaa acgtattaat agcaagatct 14820 
tgccctttta tttaggcagg gagagttgat 14880 
ggtgctatct cctcctatat aaatcatgta 14940 
ggttatattc aggggattag agtatacagt 15000 
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agaagatccc 
cggtggctca 
caggagattg 
aaattagccg 
gagaatggtg 
ccagcctggg 
cagcagaaat 
tggttaagag 
cactgtgggg 
agttcgttgt 
ttataggctt 
ctgctccaaa 
ctctcaataa 
gcaaaaatgt 
tcaagaaata 
gcatgagtac 
atttccttaa 
ttgagaaaaa 
tccttcctcc 
tttctttctt 
tcctttcttt 
ctttcttcct 
tttcctttaa 
aggaaagtga 
tcttctttta 
cctttatgtg 
aatctgcgca 
cccagcatcc 
ttatcagagc 
agaatatctg 
ctatcgcacc 
atctcccttc 
cctctctaag 
ccaaccgcca 
gctgcttctg 
tacctttggg 
aaaataggat 
atttaatcaa 
aaggattttt 
ccattggtca 
tgcctgtttt 
ttattttgtg 
taaaatttct 
attaagacag 
aagagattct 
acagctctaa 
ttcagtgatc 
caggctttgt 
ggagccataa 
tctttattta 
ccctgacctg 
gaacaagaag 
ggtgttaatg 
aataatgacc 
gtatggtcca 
agataataaa 
gcacatagaa 
tctctatatt 
gagttaaaat 
ttaccccaaa 
gaggagggct 



aagaaacctt 
cgcctgtaat 
agaccgtcct 
ggcgtggtgg 
tgaactcagg 
cgacagaacg 
agagccaaat 
agcagactct 
ttctaattaa 
gccccaattt 
actgtgagca 
agagtttgtt 
ctggaaatga 
tcgtttcttt 
ttgttgaatg 
aaatctgggc 
ggaactgata 
cttattttct 
ctcccttcct 
tctttctttc 
cttcctttct 
ttctttcttt 
gcagaccatg 
agcacaatta 
cagtttcata 
tgtgacaagt 
gcagttcctg 
atctccttgc 
aaaggctctc 
cacggatgac 
aaggacagat 
caaaacacag 
gacctgaaaa 
aaatattaaa 
ctaggtgcaa 
attctaaatt 
tattccttta 
tcagaaattg 
cttttacaat 
gagttttact 
ctcttcacat 
gcctctgatt 
taacatatta 
aggtcttaac 
cccaggatca 
cgtatataca 
taagtcaggg 
ggaccctatg 
gcaaaatata 
caaaagcagg 
agaaaatctt 
ctctgtcaac 
ataagtgaaa 
atagactcct 
ggccaagtta 
agtattttcc 
tagggcccag 
tttattcatt 
tgtatgtatg 
tttgatggct 
tgtctgggaa 



gggatcattt 
cccagcactt 
ggctaacgtg 
cgggcgcctg 
aggcggagct 
agactccgtc 
catgaacata 
taactagaac 
cccctgttac 
gttcatctgt 
ttaaatgagt 
aaacacttaa 
tcaagcatag 
tgttcagccc 
aatcaatgaa 
aaagtgacct 
aattggtgtg 
ctccctcttt 
tcctccctcc 
tttctttctt 
ttgtgccttt 
ctgcctttct 
tctgttagat 
tcaagggtct 
tctccaggcc 
gaaaataagg 
ggcgtttcag 
tcatcttaca 
ccactattct 
ctgcctcacc 
ctcccagcaa 
ttctcttaat 
caactggcca 
ccattttggt 
aaatggaaaa 
tactcatatt 
gctcgataag 
atttttggaa 
ccagactata 
gaccaattcc 
accccaacag 
acatggtccc 
ataatatttt 
gttctctttt 
cagttaggta 
ccctttgtat 
gtcagcaacc 
gtctctatca 
aactaacgaa 
tggcagatca 
atatttatgg 
tgaaggtcaa 
tagttaaagt 
gaacaagaat 
cctaatctct 
tcagagagct 
cctatattaa 
atcataaaat 
tgaagggaaa 
taaagtaaca 
gacttgtctc 



tagactaaga 
tgagaggccg 
gtgaaaccct 
tagtcccagc 
tgcagtcagc 
tcagaacaaa 
agctaaacaa 
actgcactcc 
ttaccagtgg 
agaaggggta 
tactactgta 
gaactgattt 
gccctggaat 
tgtgcctaga 
cctaccgagg 
ccaaggaaat 
atacaatgta 
tttccttcct 
ctcccttcct 
tctttctttc 
ctttctttct 
ttctctttgt 
gaatgccttt 
ccagtcatct 
tttcattggg 
aaagaaaaaa 
ctgcttccca 
ccctgtgtgc 
ggttcacccc 
tgggagctca 
gaatgacaga 
tctcccaaga 
tttcagctat 
tggaatgata 
aaaaatactt 
ctcaaagaaa 
caaccagaag 
aactgtttcc 
gaaggaaatt 
cacctctgcc 
ttacaaatgg 
ctaaattttg 
gtttatgtgt 
tgaaagagaa 
acagagctgg 
aacatgtcac 
ttttctaaaa 
taactgttca 
tgtggctgtt 
gaactcactt 
acaacattta 
ggctggagtt 
tagaagatcc 
gtctggactt 
ccaggcctcc 
gtaagaataa 
tttatcaata 
gtttatctac 
ttattcctgt 
acattcattt 
tgccctatgt 



aatgccaata 
aggtgggcgg 
gtctctacta 
tactcgggag 
cgagattgcc 
acaaaaggaa 
atgttggcag 
atgtcctcac 
cagtcttaag 
ggatgacagt 
tttgtaaagt 
acttgcatct 
ataagcaggt 
tcaatatcta 
tagttacata 
ttccactttt 
aaaaaatgtg 
tccttcctcc 
tccttctctt 
tttctttctt 
ttctttcttc 
ttttctttct 
ttctagttaa 
ccacatgttc 
tcaggttggc 
actcaagtga 
catcacctgc 
atgacaggcc 
cctacttagc 
gaggagctca 
aaagactaac 
aaccagaatg 
ttaaatcaac 
acataactaa 
ctaatcaggt 
tatattcagt 
ttcttccttc 
tatgaagcta 
cacaacctgg 
ttacacctaa 
ttgttattat 
acctaatcac 
caatatctta 
tattaggatt 
attttagtcc 
gaattcagca 
aggaccaaat 
aatcaccatg 
ttatgggatt 
atgggccata 
gactgtgact 
ctgaaagcaa 
cagttataag 
ctggcttagg 
attttcttat 
actgagctaa 
aatgccagct 
agattggcat 
tactattgat 
tgcaaacaaa 
ggtatcagca 



ccgccgggcg 
atcacaaggt 
aaaatacaaa 
gcggaggcag 
ccaatgcact 
atgccaatac 
tgtagcctag 
tgtagaccct 
gcattcctta 
agtgtttact 
gcttaaaatg 
aaactgacag 
ctacatgaag 
gtgatcatgc 
aaagagttct 
agattctgtg 
cctatatgat 
ctcccttcct 
tcttcttttc 
ctcttctctt 
tctgtccttt 
ttccttcttt 
aaggttaaac 
ttaatcatta 
atttcgctgc 
agaaaatcag 
ctcatcaagc 
caccattcat 
cagatataca 
gattccatta 
tgcccccaaa 
tgactgctca 
tttaaaaaat 
cctgctgaca 
caaatcactc 
catagtgggg 
aaatcttgac 
tctctgcctg 
actttcacct 
cggaagttta 
taagcatctt 
aaaagattgg 
gcatgtatca 
cagagatatt 
aggtctgtct 
taaagggatc 
agtaatattt 
tagtgtaaaa 
tttttttaac 
gttctctgac 
tgccaagtaa 
agagctgtct 
aagcacaaag 
cactcttgtt 
cattaaatga 
cccatgtcaa 
acatattagt 
tgtaaggatg 
ctgcatcaca 
tttgaaattt 

gggggaggct 



15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
177 60 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
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tgacggactg gcacatgccc ttccagaatg gcccactcgc atgcctgcca agttggtgct 18720 
ggctcttggc tgggagctca gctggggctg agtgctaggg tccctgggag gttccttgtg 18780 
gcctgaactt cctcaccaca aggcggctgc ggtgcgagag tgagcatttc aagatagagc 18840 
caagatgaca ctgtattact gtgtaagacc cagcctggga attaatgtag cctcacttcc 18900 
atcccactct atttttaaaa agtgaattat taaggtcacc ccatattcaa ggggatagga 18960 
attagacttc atctgtatta agaaaaatgt ttttaaaaat tgtagacatg ttttaaaatt 19020 
ctaaagtcca cttactggct gcagattatt tatatataca tgcaagatac actcctacat 19080 
tctcttctta gaaggctcag ttgcaggtac agatgaagct cttcaagtga gatttcttat 19140 
gtatttatcc tctcaatctg aagacttgta aactaagaga caagttattt gcaacctaca 19200 
tacgcaatat tcaatggtaa agtatacata ggacagccac tacagacact cttgttttaa 19260 
atagaggaaa atgagagcac ataacagtca ttggctcata gcaactctga tatccagaca 19320 
gcaaacacaa gcaggtcttt ttttaggtct cagtcctact gcctggattc cctactgctc 19380 
ttgggtcttc cctccaggtt cttggttctt ggacctcttt tcatttaata ctatttctgt 19440 
tcctttaagt tcaagctggc aaaatatgat tgtacaattc tgtttaaaat tccaggactt 19500 
cctgtgattc ttattgggga atactccatt agacaagaat ctctttgaca taagccattc 19560 
tctacctgag atccctgtaa ggctgtgatg ggaccacata accttaaaat tattagaaga 19620 
ctcattgttt actgagagaa tatgcctagc atatgcttag atccttagag gaactctgtt 19680 
tcaaagggct tatgagacat taccttatat ctttctaagg tacaaacaaa aggtctttgg 19740 
cttttgagtt tgatctttga gctgacacct tttcttaatt tgagaatccc ctgctctatg 19800 
gagagactga caaagagaaa tagttttata tttgaatgta acatcttgga tctttaatag 19860 
attatcttaa aattttcctg aaaatgtaac agttcctttt tttaaaattc attctcccta 19920 
cacacttatt atatatgact aaaagaaact ccctggcatt ttcaacattc tggttagaat 19980 
ttttcttagc ccaatctacc agtttattag gtatatttgt attttccacg tcactgtagg 20040 
tgacaatctt gctaaacttt ctgccactac ataacaagga tccctttctc cagttttcag 20100 
taacaatcta ccaatagcct cctcaaggtc taccaagctt ctactaaaaa tctctttcag 20160 
ccccttccaa cttctgccca ccaactagtc ccagaagcaa tacccatgtt ttaggtttca 20220 
ttatagcagc atctagtttc gagttctcag aacctgtttc gatcatgttt cactgtttcc 20280 
aaaacatccc aaaacagtag cttaagacag taacacttat tttgctcaca ggctgtacgg 20340 
gaagcatggc tggagaggtc tcaggaaata tacaatcatg gtggaaggca aagaggaagg 204 00 
aggtatgtct tacatggcca gagcaggaga aagagaaggg gaaggtgcca cacacttttt 204 60 
agaaacccag atctcatgag aacttactca ctatcatgag aacagcaaag aagaaatctg 20520 
tctccatgat ccaatcacct cccaccaggc ccctcctccc acactgggga ttacaattcg 20580 
acatgagatt tgggcaggga cacaaatcca aaccataaca gttggcaacc cttttttaaa 20640 
gaaagtaatg acatcaactc cttggggatg tggattgggg gagaatattg gagaggatcc 20700 
aggggaagtg aagatatcaa gttctttata cataaataga tctatctttt tagggaagta 20760 
gaatatgtca tttaggatag ggaaagttga agatgttact ctattcagct ttagggaaac 20820 
tccaagatgt aacactatgt cctgtgaata ttagtcttgt ggaaagtgct ccattggaag 20880 
aagacagaaa atgtcctgct gcacagagat ccatatcttc atgccacgtc ttgcaaatgc 20940 
agtcagggag cttgctctaa attcgtgttg ccttttagga agttacagag catatttctt 21000 
tatatcttct ttatttttgc ccttgatagg agactgccag ggattcagtt attataattt 21060 
tacatttatt tatccttttc tgggagcatc tcataccttt tttactagag aggccaaggt 21120 
cacaattaag attagggaga caatacagag tcaagacacc tgaattttgc cacactactc 21180 
ttgggccact ttaattggtg tttctgggcc tcatttttat tactattatt attaattatt 21240 
ttatggtgtg tataggagct ggaagatagt gacttctatt agaatttatt cttataaaca 21300 
atgctgtatg gctcttggat taagaagtaa atagaaaaca aaaagggtct ttgaactgct 21360 
gtctcttcta ggtcttctca gagtgctact gggaattaga tgggattaga tgggtgaatt 21420 
tcctttttct tttgtttgag aggtctagat ttttgaagta tctaaaccat ctttagatct 21480 
aagttaatct ccaaaaagct ttattgcctg aaacattcca gttctaggaa gtttgcattt 2154 0 
ctctgagtgt aagggtctgg tctgtaactc cttgagatca ggaatgtgtc tttttatccc 21600 
catagttcag cgcctaccct agtgtcttcc ctacaagtga tcattgaatg aagaaaggaa 21660 
taaattcctt cttctctaac atacatcaga gacctctgac ccttattggg tcccccttcc 21720 
ctattgatat acctctccat ctccagacat ccacatggcc tcctaggggt atgggattct 21780 
tcactcttct cctggtacaa gctccttgcc tgaaattttt ctgacccacc cccatttcta 21840 
accgtcacag gtttagattt tctgaagcaa acatacagac agagttcgag ttacaagaca 21900 
tttaatagga atcgactctt gtgaaaggaa gaggggaaga tgtgggactg ggctcagtga 21960 
gaagccaagc tgtgacacaa tgaaaacctc agcccaccct gcacagagct caggaatgag 22020 
tgtcacgtgt tagagctgac ctgggtcagg tcaacattgc tgggtgtttc tgcttctgcc 22080 
tcgttcagtt gtcagatgca agctgacctg ggaagggttt gactgggcaa gatggctgtc 22140 
tgcagatgag gtcaatcaga aggggctgac agccaaaagt ggtttgctca tctcactacc 22200 
cacagctggg tggcaagtcc ttcaagaggg agctggatga tgcttctcca cctctaccac 22260 
accaaacaaa. gcatttcttt cttattcttt ttcttcctgt ctcccttccc cctcccccct 22320 
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ctcccctttt ctcctcctcc tcctcctcct ccttcttctt cttcttcctc ttttttctcc 22380 
ttcctctttc tttagagaca gggtcttgct tggttgccca ggctggagtg cagtggcaag 22440 
gtcatagctc actgcagcct caacctcctg tgtttaagca atcctctcac cttggcctcc 22500 
caagtagctg ggactacagg catgtgctat catgcccgac taatttttta atttttacta 22560 
gagacagtct cactatgttg cctgggctgg agcatttcta ttcatgggtt ttgttcctct 22620 
ggagctgtgc tgttctcttt gataacatga atatggattt gtatttaccc ccttctggcc 22680 
atcttaacta atggcccaaa ggaaggccag tggtgaagat gtttctctac agggaagttt 2274 0 
ctaagtgttt ctcagcctac atgtgtgagt gtattaatgg gaatactggg gataattgga 22800 
tagaatgtat gaaattagga ttggcctaaa ctatctggtc caaagtttat cctagtggcc 22860 
ctaataacat ctccagtgat ggtctttcct gaggttatag aatgagctat aggaaagagt 22920 
gatagtacag gaatgagagg attggatcgt cttttctctt tgttctattc ctctatgatt 22980 
atttgatttc ttttttttgt gatggagtct cactctgtcg attatttgat ttttgtttca 23040 
gattaaggcc agctttcttt gactgagcag acccacgtat atgggaaacg gccacacacc 23100 
aaattaaacc attcactgac acccgggcac acccattttc accactctgc tcttgcccgt 23160 
gccattgcct ttgcatccaa gggattcttc cttctgtggc aatctgtatg acatccttac 23220 
taagaaggcc tccctcagtg cccttggccc tctcacagcc ctttccatgg ccttctccac 23280 
tagactgtga ggaacccagt gtctggaaca gttcatggca ttgttgttag tgttgtcatt 23340 
ttgttgttaa gccttctgct gtcagtggtc atttcatatg tttgcatatt tacgccctgc 23400 
ctctttttgg tatgcagtgt ggaccctatt gcacggctac ttgtgtttca gtggctttgt 23460 
cctattccac cttctatgcc acttactctg aatgcctggc acattggtag gtgcccaata 23520 
ccaacttgga aagtgaatga atttgttttc tcattacctt ttgaaatgcc tatacagatg 23580 
ttcctccact tatgatggag ttcattctga aaagctcatg ataagtagaa gatattatgt 23640 
cagaaatgta ttttattgta ttttttattt tttaatcttt ctttctcttt ctttctttct 23700 
ttcattcatt cattctttct ttctttcttt ttctttcttt cctttcatct ttcttttagt 23760 
agagacaatg tctcactata ttgcctaggc tggtcttgaa ctcttgggct aaaagcaatt 23820 
ctcctgcctt gctctgccaa agggctgaga ttacaggcat caactactgc acctagttga 23880 
aaaatgcatt taatgctcca ataaacccat cataaagtca aaccatcagc agtcagtgac 23940 
tatctgtata ccttaggact aagttagaag ttggagatgg taaagtttct aaattagtac 24000 
aagcacagga gctcttattc ctgccatttt atgaaagtca tgtgagtatt ggtgaccatc 24060 
actaatgggg ctggtagaaa ctttaggacc agtctgtctt agtcagctgg gctgctacaa 24120 
taaaaaacca tggactgagt gggttaaaca atggacattt acttctcaca gttctggaag 24180 
atgggaagtc caagatcaag aagccaacat ggctggattc tgatgagggc tttttctgga 24240 
atgcagatga tcaccttctt gccttgtcct catgtggaag agagagcaaa ctctctagtc 24300 
tctctttctt ttcttaaagg acaataatcc cattatggag gcctcatgct catgacctca 24360 
tctaaaccta gttacttctt aaaggctcca actccaaata ccatcacact gaaggttagg 24420 
gtttccacat ataaacttcg gagggacaca aacattcggt tcataacaag tgcattctac 24480 
aggactccag ggagtgtgtt ggctaccctt catcccacac agctgcaaca atcctaatcc 24540 
actgatatca agtcaccata tatttgaaat tcactcagtt cccaagaaaa tatctgcttt 24600 
cataatcatt tctcagctaa ggaatgaaag ctatgataag aaagtctagc acttagtggg 24660 
aaacaaactt gaattaggcc aggacccctt aaatattgtc actatgaaat ccaagggcag 24720 
tatttacatg cttttccttc atgatgacct tagatttcgc ttctttatgc tgcaatcaac 24780 
tgaaaacatt ccactgttgg taatccttta tcttctaaca aatctaattt agtagatctt 24840 
gttactcctt atttttgaaa atattaaact aaacaataag tgatctttca gggaaattct 24 900 
gcaagggaag tggcttctag agaatgcctc ttctccaaaa ggcctagttc tatttctgat 24 960 
atctaatcag ttctgctccc tttaaataat tactgaaact agtttattgt ttaaatgacc 25020 
tcatttgaaa gtgggacttt cccctttgcc ttcttccctt ttaatttctg tttttaatta 25080 
atgtaagtgc agtcagaact gacggtgtca acctttcctg taactgtgat tgccacttgt 25140 
ggacattctg tactcataag agtgattctc acaagttctt taagtttttt ggaagtgcaa 25200 
acccaacagg aagaaagaat aggactccct gcattctatt ttgctctgca gtgaagatgc 25260 
attgttgagt aatttgcttt acattcagat atacccagat tcccatcttc ccaatccatg 25320 
tgacttggct aagttacaac tctttataag cctcaatgcc tgcaactgaa taagaaagtg 25380 
gtaaccatct catggggcat tgggtaaatg agataatact tatgaagtat gcagcacaca 25440 
gccaagagcg taacgtgttc aatcaatgat atccattatt attacatgtc aaatcacatc 25500 
tgacttctgt gattctactg gggaaaagga tgcaaaatcc tcttgccgtt ggggtgtcaa 25560 
atattttagt acttatttcc aaattcattg gaggggtctt tccctgaagt ttacagatgt 25620 
ggaatacacc attcaggcag gctttcttca gtagtgcagg tcatttcagg ggttctccac 25680 
attctagaat aatttttttt. tttacccact acagacacat ttactggcaa ggcccatgat 25740 
aacaaaatga aattaataca ggctttacag tcagagacct aggcgacctt gggtaattca 25800 
ttaatatctc tgagcctcta tttcctcagc tttgaaagag gcataagaat tgctctgaag 25860 
aataatcatg agtataaaat aagataaatg agataactat gtaatatgca ttagtttagc 25920 
accaagccat atacaaaggc acaataaaga gaatctaata tgtttccttt attcccactg 25980 
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ctatgaactg 
cacattgtaa 
atgaaggtgg 
aacaacacaa 
gccttcatcc 
ccctcccaag 
tgaaaaataa 
gaactgagat 
ttaccagtgt 
taatgctgag 
ctaacagaaa 
aaagtgagta 
atgggattag 
agagaaagag 
atctttgggc 
agaggagaaa 
caggaccaga 
tctagggaga 
gcccaaccaa 
ttattattta 
ggatgtattt 
aggtcaaata 
cttcttgtca 
taggtgtgtt 
actattcctc 
cttagaagtg 
cccaggttat 
tagcagcaag 
aacagtagct 
ctagtcatca 
atcttcatca 
ctttgttttg 
agactatgaa 
acatggcaga 
gtgttatgaa 
ttcttctatc 
gaaag[agtaa 
ttttcactgt 
ttaaaggcaa 
ttggcaggga 
tctcccatac 
aaggtgacat 
ctcagtgtcc 
catattaaat 
agattcacag 
cttgactccc 
tggactctgg 
agaagctgat 
cttcttgtat 
tttcagcacc 
atttgtttcc 
acattgattc 
gcttctaagc 
atacatttat 
aaatctcaag 
tgggaaggat 
gttctcagca 
ccaactcaga 
gatagctttc 
agtatgggct 
aggctgcaaa 



aatgtgtgtg 
aggtatttgg 
agcccccatg 
aggaagacag 
ccatcgccat 
accctgaccc 
atgttggttg 
acctattaac 
ttggtactaa 
ttggtctcag 
gctggcactt 
attgctaaag 
gtgggtgatg 
cagagaaaga 
aagccaactt 
tggaccttag 
gcagcggatg 
tcctctcatg 
attcttgaaa 
gaaaatgatc 
tttcacatgt 
ataaattgcc 
aaaaactaaa 
ggactgaatg 
aacagaagag 
tttgtaaatg 
ttcccaataa 
agtgattata 
ccaacatgct 
caggtgattg 
ctccttaatg 
gaaactaaat 
tagactatga 
tcaaatattt 
ataaaaagaa 
ttatagcatg 
gtgcctccac 
tcaaggtcac 
ctatccataa 
aagccctctt 
atctcctcca 
ttatcaatta 
tcatctgtga 
gggataacat 
agcagatgga 
attctgctaa 
gccttaaggg 
ctgcattagg 
taatctctct 
tgttcttaat 
tcagaaactc 
ttttattttc 
atagatgggt 
tatttataaa 
cgatatgcat 
aaacaaagac 
ttggctgaat 
ccaatgaaat 
caggtgattc 
gccttgggac 
gggaagatga 



tcccccgccc 
^99tggggca 
ctgagatcaa 
catgaaatct 
gcacccgcaa 
tgctggcgct 
ttcaagccac 
agatgttcct 
atactagctg 
ctgtgggaaa 
tcccttactg 
ggaaacagat 
aagagtaatc 
ccagttctct 
attttcttac 
atgttctgtg 
aagatgccaa 
actcacacat 
taaggtcagt 
attggaactt 
caaggaacca 
ataaaataca 
ggtggaggag 
tacatatgat 
gacaaaaaac 
acagagttag 
gccaaagaaa 
gtttccacag 
tggcatcaag 
aaaagagaag 
cagacagaca 
gtctattaag 
atagtcaaga 
aaattttaat 
ctcattcttt 
actgaattcg 
aaacacattc 
gtagtggttc 
aggttccttc 
cttgatagag 
tacagggttg 
taattaggtg 
aactggaaat 
atataaagca 
gcactgctgt 
ttctccctct 
aaagctaaga 
gctggttgct 
cccaaagttc 
tagttcaaac 
ctcttagact 
cacaatttct 
gggtggttgg 
ctacttattg 
atcaatacaa 
ttataataca 
aatagaacca 
caaaatccct 
cattctgcag 
cacaattaag 
catctgagct 



accgcaaatt 
tttgtgaggg 
tgcccttatt 
ctctctcttt 
aagaaaggcc 
ctgatctcag 
ccagtctatg 
ttccttcttc 
ttaattctcc 
ccttttgccc 
ggcatcagcg 
catgtgcatt 
attataataa 
tgtgaaatat 
aaataagcag 
gcataatttt 
gatcaggcac 
tgcctctctc 
tataaatgcc 
gttatcagaa 
ggggtaatat 
gactgctcat 
agtaaaatcc 
tggatcatag 
gaaattgaca 
gatttgaacc 
tttccaatgc 
acagaaattg 
gctgtgactg 
atatctttta 
aaactgattg 
acacgcgaga 
gggaggtcga 
gacatattag 
taaaaaatgg 
aagtgttctt 
actgggagtc 
ctggcccaaa 
atctgctgtc 
atattggacc 
tcaggtcagt 
acttaggagg 
aataaagcct 
cttagcacat 
gtctcatgat 
ggctactagg 
ctcgtccttg 
cccttctccc 
ttgctgaggt 
ttttgcaggt 
acatccttta 
aaccctgaac 
gatgtgtaca 
aacacttcct 
aataatttca 
atatggtgca 
cctggctata 
gtgagtaaga 
ggaggtttga 
aggtccctac 
aagtcttttt 



catatgatga 
aactaggttt 
taacaaaaat 
ctcttttccc 
atgtgaggac 
acttccagcc 
gtatttcagt 
tccctcattt 
cttagtagaa 
agagagaaaa 
aaaacctgat 
tacatagtca 
ttttacaacc 
ttgccattta 
taactagaca 
cctttcacat 
tgccaattgt 
ttgaaaatga 
tatagcagct 
gacacaggct 
aacattcaca 
gagctgtttt 
ataactgtgt 
agatgatcaa 
tttgttgaca 
cagttccagc 
caacctattc 
tgtacagagt 
cacgaactat 
gcagactaaa 
gctagactaa 
agacaagatt 
ggaaaaggga 
atggaatata 
caatccaata 
gttcatgtct 
cagaagtgga 
caagtaggag 
tcctagtcca 
tccaacctat 
aatccctact 
cttactcaac 
atttcataaa 
gctatgaccc 
actcaagctt 
acatcttgac 
tatttctact 
catctcaagc 
cagatttcat 
tttctttcag 
tctccagaca 
agctgcatga 
cgttattcat 
agtgcttgga 
gttctcaaga 
agatttacaa 
ttatatagtg 
cctgggcatt 
gaatcccttg 
tctaattggt 
tttttttttt 



agccctaatc 
aaatgaactc 
cagaaacaac 
ctctccctct 
ataaccagag 
tccaggacca 
gtagcagttt 
ttatggcaca 
gacaatagag 
attaagtctc 
tcccatgggt 
gagatgtttt 
cagacataaa 
atatcttttt 
cagatattta 
aggggaaaac 
tttgactacc 
gtaaagagtt 
accttctttt 
gagagggaag 
ttctagagaa 
gaaatcttgg 
gctttttgtg 
agaagtgtat 
tttccttggc 
gagtttaaag 
ctggcatctt 
tgggtgaaag 
tccaggaaaa 
atctgcaggc 
taagatcaga 
tcctaaacat 
attcccctta 
agaagttttt 
tttatacacg 
ccttctctga 
gagtatcgtg 
ctttctaaaa 
ttggcaggac 
ctcttcatct 
ggccatttag 
tccccagaat 
cctaattctt 
cagaagatac 
gaatatcaag 
tttcagggtc 
ttatgtaggt 
cccagaatct 
agtcctgctt 
gccatatttt 
cctgggatcc 
atgtaagcaa 
tcattcagtc 
atttttttaa 
aacttaatct 
taaagcagtg 
atgcttgact 
gctatgtttt 
gatagaggga 
aaggtgctaa 
tttttttttt 



26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
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tttgagacag agtctcgctc tgtcacccag 
ctgcaacctc cgcctcctgg gttcaagcaa 
gataacatgc acccgccact atgcttggct 
ttcaccatgt tgaccaggct ggctcaaact 
ctcccaaagt gctgggatta caggcatgag 
acaaatgagt tgcccaagtg aagatgtggg 
gagttcatag aactgcctgt aattttgggt 
tttagcctgt ctctggcagg caaaaggaga 
tatggtcagt attatataca tctgaagtac 
ttcattgcat cttgccccag cagaactatc 
cacctccctg gcttcaaaca ttagaaacag 
catcctttcc tgaaacaaag aacatctgtt 
gggcgcctaa gtattctcag ttgactacag 
tacatacatg cagatgatga catagtcaaa 
agtgttttgt ttctgataca gtgctgtcac 
tgctacagat aaattatgat ttattccttg 
tagtaataac acttgaaaat tctctttctc 
caaccaagcc tttgggacag gttagttatt 
gtttgagttt tgtgtttttc tttaattctg 
caggataaaa aaggaggagg agaaagaagg 
gaagggaaaa aagagaaaaa atgattaaat 
aatttcaaaa atgttaaaat ctgtgtgttg 
taaaaaaaaa atccccagga ggaaaatgaa 
tcaatgggat ccaaagctgt atttttagac 
attttcttca aattctcgag ggaattactt 
tctcagcctc aagtttggtt tgctttccat 
agaagcaaaa aagcatctca tcctttctac 
gagataaagt acatgaataa ataaaaaatg 
aaataagcaa aagactatgg tgcctcagtt 
tggctgttat cttgagtagg tgattcttgc 
gactggtttt acttgtttgt tggttcctaa 
atgctaaatc aggctaatga gactaaaatg 
accatcttac atatgtgtat atatgtttta 
cttgagactc atacaaatat cataaagcag 
gatatttcat tggcagagtc gtggtgaaag 
ccagctgtgc gagcttaagt tacccagttc 
tggactgata ggggatctca catgcctgtg 
gtgcttagca agtcacctgg cctattataa 
ctgttcacaa attatgcatt gattgcctat 
taccacgtac tattcgaagc acatcacaag 
tttgaggtaa gaactattat taccatctct 
gacgtaaagt cacttgttca aagcctagca 
tcagaaagcc tggctatagg ctctgctctg 
attggggcaa cagcaatgag gcccataggg 
agccccggga gatgatccct aaaagagaag 
cacgagttct gaactcttcc tcagagtttg 
ctctatgggt cacacacata ctccgagtga 
cttcccgtga cactggctac cctatgctaa 
tgtcttattt acaggttctg tcattttcat 
tctgtgggtt gtcaagcaaa ttcattctcg 
ccacacatta cagagtcaat gattctaaac 
gctgcagtgc tagaaacaaa ttcacactcc 
gtggggctcg cgtttcttca ccatccagtt 
gccaaacagg acattgtttg accaagtcct 
acaatgattt gaaagtcctg ggttctattt 
agccctagga aggtcaaaag cagaatggcc 
ctctagatgg aaaatatatc aggcatctgg 
aatacatgtg tagtcacact ttgttttcat 
tagggggaaa atgggctggt agagaatgga 
ccaggagtta agggattgct tgagtcattc 
gagtctccgg catttgaaaa tggaagaggc 



gctgaagtgt agtggcataa tctcggctca 29700 
ttctcctgcc tcagctgccc cagtagctgg 29760 
aatgttttgt atttttagta gagatggggt 29820 
cctgacctca agtaatctac ccaccttggc 29880 
ccaccacacc cggccctgag ctaagtcttg 29940 
aaagccattt caagggcaga ggaacaagat 30000 
acagtttaga gagtgctaaa tgctaaagca 30060 
gtcattgaag gattgttaaa aaggaaataa 30120 
caagacatca gttggggagc aaattttact 30180 
tccctttaat aagaccttta aaactgttgg 30240 
gaaactctac ccatgagcta accttcaccc 30300 
ctcccacttg gatgtgcctg gagtaaagag 30360 
agaaatgatt ggcgtcacgt agattagtca 30420 
tgtattgaga cattttagtt ccaaaaagta 30480 
ccttggccag ggaaacccag ggatcatagc 3054 0 
gaagagggcc tgaactcaaa agaaattaag 30600 
caatttcagt atgcaaatct atttacatat 30660 
gtttgtggtg gtagtggttg tgttgtttat 30720 
agatacatta tcaatttaat gacaactttg 30780 
ggaggaagag aaaggggaag aggaggagaa 30840 
ttaatttgtg ttctaattgt gaaacaatct 30900 
ggttagagat gggaaaagta tgctgtataa 30960 
atatttttag aggtactagg aattttatag 31020 
agagaaatct gaggcacagg tagacgagaa 31080 
agtagtgaag ctggaatcag aaacctcatt 31140 
gacaccctag aatattccaa ctaatatctg 31200 
tcatatttct ggctgaggga actattcaaa 31260 
tgtatcacag ctaagtacat cttagcaaag 31320 
ctcaacactc tcttttgttc atttatttag 31380 
atatcaaatt gcattttggt gtgtcaaagt 314 40 
agctaattat tatttttaac accttttgat 31500 
ggatttctca tgggtttatt agtcatcaac 31560 
atgctttcaa agtgtaaata aatgattata 31620 
ctttttcctt agtattcctc ctagaaaata 3168 0 
atgtggtgtc atacaataaa tttacttcag 31740 
ttttctgcct cagtggcctc ttctgtaata 31800 
tttagaatta agtcggataa ttcatgtaaa 31860 
ctagccatta ttgtttgcta gaacatttat 31920 
aataactaaa cttgaggaat gcttattacg 31980 
tagtaactca tttcattttt ataatgactc 32040 
gctttacaaa tgaggaaatt gaggcataga 32100 
tctgggaaac agtagaggtg ggactcaaac 32160 
ctgtctcccg ggattggcca catgctaagc 32220 
tttctgctct caaggagatc actataaatc 32280 
tacgtcgttt gcaaaatgac agcacagacg 32340 
gggaagttgt ttagccctgc tttctgccct 324 00 
agaatgacgg agttacccat cttctggctc 324 60 
aaaccagccc ccagattcca catccctgaa 32520 
gataaccttt ctaatgttct aaatttcctt 32580 
gcagtcagta agaactcctt gttcaaaatg 32640 
acattttttt ttagtgcatc caacttataa 32700 
ctttacacta cagtgccatc ttttccgagg 32760 
tctcatacgt agattatgat ttctgctttt 32820 
tatgggagga agcacagccc agtgagcaga 32880 
tcagctcctc tgttgactag ctgcgcattc 32940 
ttgctttgac gactttctct gcatgtacag 33000 
ctctcagcct tgaggagctg gtctttggag 33060 
ttttaaagca aaagaacaaa acaaactctc 33120 
agagagaaac ctaataacat tttaaacaat 33180 
agatcaaaga aggctctttg gcacaagttg 33240 
aaagaggagg aagctttgag agactttttc 33300 
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aaaccgggac ggtagcagat actcagaact 
gcaggttaag gcctgtgggc aaactggaaa 
tagtgatacc agatgcttgt tgtggaagaa 
ttcaagagaa actgaacatt gagatttttg 
gaaatctaat catcattttt aaacttttaa 
tagcctaagc aatgtactcc ccatgtctct 
gggctttagt tgtatgttct gtgaatgaat 
cagtgactca actttagctg cactttggaa 
agcactggga ttttataaat gcttctcaag 
cactatgtta ggatattgcg agaaggttta 
taagatacat aatcttatca agacacatgt 
ttcaatataa atggcatttt cttttctttt 
actctctagc atcattgctg atggccgtaa 
taacaacaga catccattgg caaaacaagg 
gccaaaggac aagtgcaagc cctgagcaca 
tttctgatgg ccatgaagca aaatagacag 
ttcaatcttt gtccacattc taatggctat 
atacaactaa tactttcagt cttttcttcc 
acacgactaa gtctcagcgg ttgtatccat 
ttgctattta ctatgtttct cataaaagat 
tacaatggaa tgttaaatgg ggcattccct 
acacttggtg ccaaggaagc tcaccttcta 
tacactaaga ggttctaccc aagttatcac 
tttaaagaac ttggcagtgg ggatctttct 
cgtcttcatg cctcaccttt ccgataagtt 
gacactttga tgggaccact tgcagagctt 
atatcattga gatatctgac tcacacctga 
ctctctctct ttctctctct ctctctctct 
gagctatgtg gaggctaagc aggacttcca 
agaaggcaga ttgaggcatc ttgaaagaaa 
ggggaacgta ccatttgaag ccaagagaat 
gcgtaactgg agtcagggaa acagttactg 
agggagcttc cccaaactca acacaatgct 
ctaaactcag aggatcacaa tgacagatta 
ttccatctta cgaccttcct cttcctcccc 
cccacagcct ctatactctg agaagcagtc 
ttcaattatt tcaaatttct cacttcattc 
tggggatctc atatgtgatt tattgtggtc 
atagtagtta aatttataat tatttggatt 
aatttgtttt gaatactttg tctttggaga 
gagatagaga gaggagaaga tggggaaacc 
ccaatctgat aacagagttg tgagaaaaga 
tgactagact ttttactacc ttggaaaaaa 
aatcagaaaa gccaagagag aactactcag 
gagggtgagt actcaaaatg tcaggtttgg 
tatacctgca acctgtcaag tccagtttac 
gaacacataa aatgttttca tgatgtggat 
cagtcatctt tcaaaattct catctttagg 
ggttacagca acatttctgc aggaagctaa 
aaagatattg gtgacaaact tgttcactca 
catgcctgta atcccagaac tttggcagga 
ttggaaacca gcctagacaa tatagcaaga 
tggtggcatg tgcctgtagt cccagctact 
tccaggaatt taaagctgct gtgaactatg 
agagcaagac cctacctcaa aaagtaaaat 
ttgttttttg ttttgttttg tttttaactg 
gtaagggaac aattctataa gaaagaatac 
caaaggctgt tattattccc aaatatttag 
ctaccatgtg caggcatggg ggatgggtgt 
cctctcactc aaagatctta tgatctaatg 
gaaaatctct ttggagagat ctgttctatg 



ctgcctgtag ggaccaggcg gggaactcag 33360 
gcccaggcca cctgtgaagt ggtctgctgc 33420 
ggaagttact cttgccagaa cagctgaatt 334 80 
tcattagaac ccctgatttt tttacatgat 3354 0 
aggggcactg agcaggcaag atcaaacata 33600 
tctatatgta gatggggtgt ggttccttaa 33660 
agaggctaga ttccagacaa aaatgattag 33720 
tacctgggtg ttacctggga ctgcagcctg 33780 
tgattccagc atgcaaccat gaataagaac 3384 0 
agcattacag aaagttgggt tttaacttga 33900 
gatctctgaa catggatcag ggaattgaca 33960 
tttagactct tttgagagtt ccagttaaaa 34020 
ttaatatctt tgggttcatt agcaatcagc 34080 
gtatagtggt agaggtcaaa cacaacgaaa 34140 
gataagtatt tacagggaaa attaactttt 34200 
tagagcaaag ggagcctttt acaagagcag 34260 
aaaactcacc atgaagctag caaatggtgc 34320 
ttatttcaca ggtcttatgc attataaata 34380 
ggctagttct tatttcagct ttggttcctc 34440 
gttagtgtgg attgagtata tgaaatcatg 34500 
cttgtctagg tatatcagaa gaggcccaat 34560 
aaaccagatg gtagcttgcc tataaaagga 34620 
attggcagcc tccagcttat caagagagac 34 680 
agagggcgtt gtcaatttag gaagcatctt 3474 0 
tgtttcttcc ttaatgcaaa caaaatgaga 34800 
gacatgtgtc atctggattt taacagacag 34860 
aaacactgta actagaacta ctctctctct 34920 
ctctctctct ctctctttat gcctaaaaca 34 980 
aaatttgaaa caaataagga acctgttcaa 35040 
ttgcctggca gggtgtcatg actccattag 35100 
aaagaagtca tggatgacta gcttgagaag 35160 
agagagatgt cctgcaatgt agggaaaaag 35220 
ctatgaaaca cacatcagtt gggatgttca 35280 
taagagcgcc agtcaagtta gaacttctta 35340 
tacactcctc tgtattcttc ctcttgctcc 35400 
tcttgctctg ctccttctgt gaattgttaa 354 60 
tctgtctagt gggtgtgagt cctgggtgtc 35520 
agaatgcaaa aatataggca aaatcataac 35580 
attgaaatac atctataaaa agtattttat 35640 
ggccagaaat cacagacagg aaaggaggag 35700 
aacctggcct ctcctcctac tcctggattt 35760 
cagagcttaa gtttacatgg ataagacttt 35820 
gcttttttaa aaaatttatt acctgttggt 35880 
agttttatta gttagaggaa cctggggatt 35940 
aggggcagaa gcaaaattta aaaaattgtt 36000 
acaataatac aaaataaacc aatttaactt 36060 
tcaatttaac aagggcacaa catgatacca 36120 
attcagcttt ggtaccttaa agatatattg 36180 
tatgtattcc ttaaaaagag ttccatgatc 36240 
aaaaataaat agaggccagc catggtggct 36300 
caaggcagga ggatcacttg agccccagag 36360 
ccctgtctct aaaaaagaaa tagctgaaag 36420 
tgggaggctg aggtgtgatg atcatctgag 36480 
aaggctccat tgcaccccag cctgggtgac 3654 0 
aaataaataa gtaagtaggg tttttttttg 36600 
taggtctcct agaatgtttt aatgtgtgtt 36660 
ctggtggtgg cattttcaag ctcatttgtt 36720 
gcttccattt attaaatatt tgctgaacat 36780 
tggggtttaa ttaaatatat caaatgggct 36840 
gattgcttat tttagtcctc atcttagttt 36900 
ccatttttca gatttctgcc actgcaaaac 36960 
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tatttgttaa cacaaaaata caaattacct 
gtttttataa aacaaaattt aaaagtgcat 
tcacctttcc tttctgtcct tgatcaactt 
tataaagtca tccacctctt ttttctctag 
aatccccaca ttctactctt tctgctccct 
tggtgtgggc ttcctgtgag atattttgac 
agttttggca attttcttcc ccttcaccag 
atatgtacat acacacacgt gcacacttac 
ttgtctcctt cttgcaggca aagtaggaga 
tctgggctat agcatcccag agaatgtcct 
ttgattccca ggaagatcta catccagtta 
ggtgtgaagg aaaacatggt ggatacaaat 
agctttatgc ataatgtgga ctggcagggc 
agcagatggc aaaagaagtg gtaggttcta 
agactcaaaa ggaattttta ctacaggaag 
ctctgaaaag cagtctcttg ctctactccc 
tttctcaatg gattctctgt ctagtgggtg 
tgatttcttg cggtcaaaat acaaaaatgt 
ataattattt ggattattga agtacatcca 
cttgacctgt tcaacatatt tcttaaatgt 
acgtatcgcc atttatgtct aggctaccat 
tatcttccca cttttgtcct ctgcaattga 
aaatggaggt ctcaggtgtt tgactaaagg 
aaagcaatga cgtgacattg agtgattgtg 
gcttgggact gagggagttc ccagaacatg 
tggccaaacc aggataagtt gttcatctga 
gtagaaagtt tccagtctgt agtacaggaa 
gagttgagga gatagaagca aaaatttaga 
gtagtaaaag gaagaaagct gcacaaagag 
gtccctttca gttaattgag ttctgatctc 
tgtgtttcta taatggaata gctgacactg 
aatcatgctt ctggaggttg ggaagtccac 
tcgtgagggc ctgtgctgca atagaacatg 
tcaaacacaa gaggcacccc actctcctaa 
aatcttgtga gagcaagatg gcacttctca 
accgtgagaa tcgcaccaag cttcccatga 
cattagattc cacatcttaa agttccatct 
ccaacatgag ttgtggagga gacactcaaa 
ggaaagaaat atcagaaaaa agtaagtgaa 
ttagtttgtg atcccaaaag aaggaaagga 
aatattcaga ggagccctgt gttagtaatg 
tctggaccca tcctaacaaa gcttacagga 
ttcagaaaaa caaaaaacaa acaaaaaaaa 
tccataaatc aaggagcaag tcaagaaata 
gaactaacag acaaaaatct aaaactatta 
aggaaaagat gagcataata aagagaaaaa 
cttctagaga aaacaattat aaatgagtga 
aattggacac tgtaaaagaa atattggtga 
caaacaaaca ctgagagtaa aaaaaaaaaa 
atgggaaaat ataattgtta taacctacaa 
atggaatatt ttgaaataat gatgactaaa 
aaagactcaa gaatttccat gaacctcaag 
gctaagaaaa atcatagtaa aattggcagg 
tttaagagca accagagaaa aaagataggt 
gccatccttt ttgtcagaaa ctatttaagc 
aaatgaaaaa aaagttaatc aagaattgtt 
gtcaaagaca gattttcaga caacaaaagg 
cttaaacaaa tattaaaata aattatctag 
ttgggcctac acaaaggaat gaagagagaa 
attttttatt cttttccagc ctctttaaga 
aaagaacaaa ttctggagtt tgtgtcatac 



gaggaagaag ttgtgttgta aaatagatgt 37020 
tcacaaccaa gcataacaaa tatgctttgg 37080 
cagctgatta tggaaaattc ttgataaagt 37140 
gtctaactcc tttccttgta ttggacgtca 37200 
cagaaaagat ctgacatcag tcattcggat 37260 
aaaggttttc tttatgtgtt agcactgaac 37320 
cctactacac atgaccacac aggtgtttac 37380 
cttccttcaa aacaggccaa atgcaacatc 37440 
cctgtgttga gtgacagaat tcctttacct 37500 
cacctgggag ttttctatgc atggctttcc 37560 
acttaactat ggcagctgaa tcctgacaat 37620 
gagcaccatg ggtacagctt gatgatgtcc 37680 
ctttgatcat gcctcaatag ctgtgaagat 37740 
agtaggttcc ccttctctac tggttgccac 37800 
ggacccaaat attctagcct agcctctata 37860 
tctgtgaatt gttaattcaa ttctttcaaa 37920 
tgggtcctgg gtgtctgggg atctcataca 37980 
aggcaaaatt ataacatagt agttgaattt 38040 
taaaaagtat tttataagtt gttttaaggc 38100 
atctcaattc cttctacatt ttgctatcat 38160 
catctaagac cacaataata ggtaccaact 38220 
aaaataaatc agggttttta cttctgacca 38280 
tagtatagca ctgtgatgct aaaataaggg 3834 0 
agtggccaac catccccatt ctcctagttt 38400 
agactttcag tgctaccact gaaaatgtcc 384 60 
aagccctgtg tttttaccaa ttgacagctt 38520 
ggagagactc ttatatagct tgccaggctt 38580 
gagacaagga ggctagaatt tgaagggtga 38640 
ggagacagta gactacagat gtgcagaagg 38700 
accattcatg agagatgtct tagtctgttc 38760 
ggtaatttat caagaaaaga ggtttatttc 38820 
aaagcatagc accagaatcc acttggcttc 38880 
acaagaagta gaaggggaag cagacagaga 3894 0 
gaactaatcc atccctgtga gagctaattc 39000 
caatcttgtg agagtaagaa cccactgacc 39060 
aggcagaacc ctcatgaccc aaatgtctcc 39120 
cctggcattg ctgcactggg aattaaggtt 39180 
ccgtagcaag aggcaactat ctgagactgg 39240 
aaacccatgg accctcacac agggctgcaa 39300 
aattgtcata atccaataaa cattgggtag 39360 
aggataaatt agccttaggc taaagactat 39420 
taggatccaa ttgcttccaa ataactgtgt 39480 
actaacatta cttaaagaaa tataacaaaa 39540 
gaaaaagaat cagaaatgat ggagatgctg 39600 
taaatattcc ctacatgtcc agtaaggtag 39660 
ttgaagatat taaaaaatac tcaaatggaa 39720 
tcaaaaaata aactacaggg ttttatagaa 39780 
acttgaacat aaagcaaaag aaaatgaccc 39840 
gacaaaaaac ccccagaaaa tcagtgagat 39900 
gtaattgaag ttcaggagga gaagtggaga 39960 
agttccaaaa tttgatgaaa atgataaccc 40020 
cagaagaaag gcttttaagc aataaaggaa 40080 
agagtgggaa gaacagtgaa aaaaacattt 4 0140 
tacatggcaa ggaacaaagg tgagaataat 40200 
caaataataa taggcatctt taaaatataa 40260 
tatacaatga aaatatcttt caaaaatcaa 40320 
aggagatgac tcataatcac caagtgtgca 4 0380 
gcagatgaaa aatgataaca ggtggaaaat 40440 
atgaataata tgtagataaa tgtagaacag 40500 
gatgaataac tctttaaagt taaaaacaaa 40560 
atagaagtaa aaagcatggc aacctagcac 40620 
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aaaaggttaa ataggaaatg gatggatact gatgcaaggt ccatatacca tgagtgaagt 40680 
cgtataatat tacatgaagg tgaactgtga taaatttaaa atgcatatgt actgtaaacc 40740 
tcagaagaac cactaaaaat agttatgtct aacaagacaa tagtaaaaat aaaataaagt 40800 
aataaaaata actaataaaa ataataacag ggagagagga ataaaaaaca aagagaagac 4 08 60 
aagtaaatag aaaacaaata accaatatag aagatttaaa accaacagta tcaatagcta 40920 
cactgcatgt taacagtcta gacacttgaa ttaaatggca gatactggcg ggattggatt 40980 
aaaaggcaat acccaactat atactgcctg aaatatagta cttaccaagt ggatttttaa 41040 
aattaataat aaaagaatgg caaaagatat accatgcaaa cactaaccaa aagaaagctg 41100 
gagtaactat attaatatat gacatcttat agttccaata tttatagttg gaaaattaac 41160 
acctcaatct cagtaattga tagcaaaagt agacagaaaa atcagtgaag atacagaaga 41220 
cttgaataac actgtcaatc aacttgacct aattaacatt tatagaacac tccatacaac 41280 
aagaggaaaa tctacattct ttccaaggtc attcagtaag atagacaata tcctgggtca 41340 
taaaacaagg ctcaatcaat ttaagagaat taatgtaata caaagtatat tctctgacca 41400 
caggagaatt aaactagaac tcaataacgc aaatatatct gcagaaaact ccaaatattt 41460 
ggaaaccaaa ccataggtca acgaggaaat cacaaatgat atcagaaaag attttccctg 41520 
aatgaaaaca aaacacaata tatcaaaaat tgtggaatgc aactaaagca gtgcttagag 41580 
aatttatagc attaaatatt tatgttagaa aagaaggctt tcttaaatca atgatctcag 41640 
cttccacctt agaaaactag aaaaagaaga ggatattaag caaaagttaa aaataaaaat 41700 
aaaagctacc tttattgaac acttaccagg cattatctta agtgatttat gctaagtgtc 41760 
ttttccaatc tggtcataac attatgagtt agtcatgatt aatatcccac atttcagatg 41820 
aggaaattaa atcttgctga acttctgtaa cttccaagtt tatgtagtta accaagagcc 41880 
agagttatta tttgaactca ggtctctatg ctgttgtaac ttcaacatcc tacagattga 41940 
gtgtatttat accgttaagc atattttatt atcttcaggt ttgatatttc tttgttgcat 42000 
atatattata catatatata aatactatat acacatacat acacgaattc ttcatttacc 42060 
tagaaccaaa aggaccttct cctagcatgc tggagaatgt ctagagacca gacattgcta 4 2120 
ttatcatctc tgaaaactgt ggcatcagtg aaatgaccta ccatgaagga tgtcactggg 42180 
aagtctgtga tggcagtcca tgaataattt gcacttgttc ctgccagtct ttcaagctac 42240 
atgatctcga gtcatggaga tatgggtgag caactgagta tcacagtgct ctccatggcc 42300 
accataactc actcaagtct taatatcttc tcgcaaggat aaacccagcc tccttaaaac 42360 
aaaatcaagc tcatgttctt tacatgagaa gccaaattca tctttctcca agcacaaagt 42420 
agtaaagagt gctggacatc tgatttctat ttgtagaact gccaggcaat tattgtgtgt 42480 
cttgggcaag ggacttgatt ttctaaatcc tcagttttct ctgttgcaaa atggaagcaa 42540 
aaagatctct ccaatgtcct ctccttgttt gctgctttct gcatcttttc tcctacacaa 42600 
ttgacaattt taactgcctt atcggtataa ctactgaaga atcagttccc aatatccatg 42660 
aaggtaaaag tcagatatta ttattcatcc actccaaatc ctccaatgac ttcccatttc 42720 
actaaaataa aaatgtaagt ccttacgatg tcctcacaac atagtttctc ccatccctct 42780 
gtacagccta ttatttcttt ggctttatct attattcctt ccctctgtat tagtcagctg 42840 
ggttcccgtg atgaaatacc actgactgag gagcttcaac aacacaaatg tatttattta 42900 
tttattttat tttattttat tttattttat ttttgagaca aagtctcacc ctgttgccca 42960 
aactggagtg cagtggcaaa atcttggctc actacaacct ccacctccca ggttcaagca 43020 
attctcctac ctcagcctcc cgagtagctg ggactacagg catgcaccac catgcccggc 4 3080 
taatttttgt atttttagta gagatggagt ttctgtatgt tggccaggct ggtcttgaac 43140 
tcctgatctc atgatctgcc cacctcagcc tcccaaagtg ctgggattac aggcgtgagc 43200 
caccatgcct ggccacaaat ttattttctc acagtcctag aggctggaag tctaaaatca 43260 
aggtgtcagc agatttcatt tcttctgacg cctctctcct tgacttgtag atgactgcct 43320 
tcttgatgtg tccttacatg gtctttcctc tgtgcacgca catacctggt atctctgtgc 4 3380 
atgcccaaat tcccttttct tataaggaca tcagtcagat tggatgaggg cccaacctga 434 40 
aggcctcatt ttaacttaat cacatcgtaa aaggccctat ctccaaggac tcttagatat 4 3500 
attagaggtt agggcttcaa tgtgtgaatt tgataagggg agggacataa ttcagcccat 43560 
aatactctca cttactgtgt tccagctaaa ctggtcttct gcatggtctt caaacacact 4 3620 
cttgtctcaa ggcttttgca ctggctattc cttctccctg tgccgctgca aagaacgatc 43680 
atgtttgccg caatcttgta gttttaagga ccttttcata acctaatgca catgtcaacc 43740 
catggccaga accctatgta agatttatcc ccaatttttt ttttggcctc cttgctcctc 43800 
tcttttaatt tgttcccttt ggcttcctta agtattaact ctaacagata acagctttac 4 3860 
agtttggctt cccttgtttt attcaactca gtttcaggtt attataatgt cttcatttcc 43920 
gatgtggccg ccatattgga tggtgctgaa tcacccattc aaaagaaatg tgttaatgcc 43980 
agtttatgtg ccagtcatcg ttagtatctg gaaatacaca ggaaaatatt agggttagga 44040 
ttaatgaaga ctaatgaatt cagtagtggt atacacagag tatcatcaga gcccaaaaaa 44100 
ggaacattta tatgaaagaa tatcaggaag gcgtctggaa agagttggta ctacagctac 44160 
agtttgaaaa atgagcaact tagccaggta tacagagggg aagagaattt tctagactct 44220 
gttagcatca tgcatgaaag tatggaagca tgaaacagta tgatacattt acagaaaagc 44280 
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gcatgtagag aagtatgaga taatggggct tgattgatag aaggctagag aataaaaatc 44340 
atgtgtgcta agataattta tcttgaaagg tccagggagc attttaaaat tcttaacaag 44400 
ggaatcattt ggtcggatct ccaatttaga acgtttagta ttccatcatt gaagacagtg 444 60 
acatgtaaga gtgagttctg gatgaaggaa gccaggtagg gagttactgt agtaattctt 44520 
ctatgaagaa atgaggacta gaacaagagc agaggcaaca caaatataat ggaaggtacg 44580 
gctatatgac acagtaagga aattgaatct atagaattgg tgctgggttt aaacatgaga 44640 
attgaacgag aagaaaactc taaaataaat cccaggtttc tgggtttggc aaagaaagag 44700 
atgatgatat tattcatcat tagaagggag agaaacatta acttgcagac acattgagag 44760 
tagtgaaact tccgtaagag ctgtccagtg aacaggtatt tttatacatt ttgtcaaggg 4 4 820 
caagaaaatt tcccattgat tggggcctat ttagttgcct tcttcttctt gcaagagtac 44880 
tatttttctc aacacacctt gtaatcttag gcaaagggaa gaaggctttt atttggagtt 4 4 940 
gcttttatat gagaccaagt ataactaaaa agatatagca tatcctcaac cccacccagc 45000 
ccccattcat tgcagcacct cctaaggtgt agatgtcccc tgtatcccaa aaacgtgagt 45060 
ggtgactaag tgtgatgtgg cacaaacgtt gaccacacta aagtgaccac atagtacttc 45120 
gctgctccca ctgggaaaca aaacataata caaatataca atgcttcaca gagtagcagg 45180 
tgattctttt tttttttctt aaagtcagat ttattgaggt atagtttaca ctaaacagtt 45240 
tactttttgt aacttttaag ttcagaggca tcttttgacc tcaaaaacct agttaagtaa 45300 
gcaaaggaga actttttttt tgttttcatt tcactgatga ggaagctgat gattaaagag 45360 
gttaaatgac ttgtcccggg tcatacttag tcttgagtaa gcaagattaa gtcttctgac 45420 
tcttaattct gctttttcca agatatcaca gaggatcttc cagaacagct tacttagaaa 45480 
ggacatataa tctgttcctc ataacagttt tatggtaata ggaatttaat cttctaagta 45540 
atcaactcag gcttttccac ttaccagtga aattgcagga ggaaactata gaacttgaaa 45600 
cactatcttc cacatttctg tattgatcct gtttgatggt ttgtgatatt aagggaatac 45660 
atgactatca tctagtgttt caagaccaag cccttcaaag tagagtaagc cacaacataa 45720 
aaagtgctta gttaccttct tggctctaga gagtcattaa atccaacctt tctcttctgt 45780 
ctagctgcct aggagtggct tctgttttaa tcatcatcat cattatttgt attagtctgt 45840 
tctcgcattg ctataaagaa ctacccgaga ctgggtaatt tacaaagaaa agagatttaa 45900 
ttgactcaca gttccacagg ctgtacaggg agcatggata tggaggcatc aggaaacaca 45960 
atcatggtgg aaggtgaagg ggaagtaggc acatcttcac atggcagagc aagagagaca 4 6020 
gagtgaaggg gaaggtaaca cttttaaaca accagatctc agccgggcgt ggtggctcac 4 6080 
acttgcaatc ctagcacttt gggaggccaa ggtgagtaga ttgcctgagc tcaggagtct 4 6140 
gagaccagcc tgggcagcat ggtaaaaact cgtgtctact aaaatacaaa aaattagccg 46200 
agagcggtgg cacatacctg ttatcccagc tactcgggat gctgaggcac aagaattgct 4 6260 
tgaacccagg aggcagaggt tgcagtgagt tgacattgtg ctactgcact ccagcctggg 4 6320 
taacagggtg agactctctg tttccaaaat atatatacac acacacacac atacacacac 46380 
acacacacac acacacatac atatatttat ttatttattt atttatttat ttatttgtgt 4 6440 
gtgtgtgtgt gtgtgtgtgt gtgtgtatgt gtatctgtat gtgtatgtgt atatatataa 4 6500 
acaaccagat ctcgtgagaa ctctataatg agaccgcacc agggagatgg tgctaaacta 46560 
ttagaaacca cccccatgat tgaatcacct cccgccaggt cccacctcca acaccgcagg 4 6620 
ttacaattca acatgagatt tgagtgggga cacagagcca aaccatacca ctattttatc 4 6680 
atttctatta agagttccct gcatttcaat ctggcctaga atacctgctg agatcatatt 4 6740 
cttatgggta ttaagtgata gtgtcataaa ttaccatctg caaaatacta tttcagaagc 46800 
tgtaattctc tgtcaagttc tctgttatat tgctacactc tgcagatttt tagaaagtta 46860 
aatattatat atgtatactt ctgcattaat gactttattt tgattcttgt ggtatctcag 4 6920 
gaatgtaata tgcaaagttt gtatagatgt ttttctttag tgtaatgaat tattaggata 46980 
tgactctttt gtaaaaggga gaaagaacag taatttgaat ccctgcagag atagcaatgt 47040 
atttggagac aatttgaaat gttttatctt tatctactaa aaattccagg ttgaaaagaa 47100 
taaaatcacc tcctctctct aaatgcttta ctctggtagt attattatgt aattattcac 47160 
aaaaaggtgt gttcttgtta cacaatcact accttctcct tggagatttc aaaaataaat 47220 
agatgagaaa ttatgtccct caaaaaagac cacaaaagtt tatccagaaa actttccctg 4 7280 
tattgatttt tctttaataa atgaagctac atgttttcat ttttaagaaa tataaaacct 47340 
taacttgttc ttctctcttg atacagttta ggaggtcttg catgcattct ctaacctcac 47400 
ctccttcccc tttctccaac tacactgtcc tctttcagtt cactgaacac tccaagctca 474 60 
ttcatgcctc agggcctttg catatccaga tatctccttc tgctcactcc ttattaccac 47520 
catggtccat ttgaactgga tcctcccagt tttttctcta tcacttcacc ctattcattt 47580 
ccttattaca atttacaaaa accaaaaagt gttacttctt atctctcttg ccagctctat 47640 
cagagctgct accacatctg tgttggatgt catgtataac cagcggctag ttccatgcct 47700 
ggtacatcat tgcttaatat ctttttgttg aatgaatgaa tgaatggatg aatgacaaaa 47760 
attcctaatc aaattttctg ttgctacctt ttatacttac aacttaatat tacatcctta 47820 
tactctaaac tcatacatac ttctataaat aggagaaact ttacctacag aaatagcaca 47880 
ctaagattgt aagcaggaga aagataactg aagtttttcg aatttagaat ctattctggt 47940 
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cactttctaa 
gttaatagct 
ttgacaaaaa 
cttcccccaa 
aactcaaagg 
tttttcttca 
accattgtta 
ctttaagttc 
tttttaatct 
tgacagggag 
ctttaacata 
agaaaatgca 
tcagactttg 
aaagcaatca 
ctgtcacaaa 
tgaggagtcc 
ctcacatggc 
cccattcatg 
taagatcaca 
agtccactgc 
tggcttgcag 
aatatgcatt 
caaatatgtt 
cagatataga 
gatacctaga 
aaactccaag 
gatgtgaaaa 
cttccgtcag 
aatcttgctc 
tctcagaatc 
agcactcgct 
tgcagacact 
gctgctaaaa 
gcaatgtata 
aacttttaca 
tgcagagaat 
gtgaatcaga 
tgaggtggac 
ccattgattg 
aatgactcac 
aaaaaagaca 
tttagattgg 
gactctgata 
ctctcctacc 
tataataatc 
atgagactca 
aagactcaaa 
tcaaagatgg 
ctgctgaagc 
cataggttgg 
ttaatttatt 
gctaagaaat 
tccatgaggg 
tcttatctgt 
tcttttttcc 
ccttggtaac 
aagcatccag 
attcaagttt 
accgtccctg 
gttgcttgtc 
tcctacctaa 



tgagttccag 
tttcaaatat 
gcccctgtcc 
attacttatt 
aagtaattac 
gacagaaaat 
aagtattaat 
ctctcaacaa 
aaaccaaaat 
accaaaacaa 
tatcggaatt 
gtattatatt 
tgagacatgt 
cagacgatat 
gtaccataga 
taggtcagag 
aagaagagga 
agggctaaaa 
ctgggtgcta 
agtgtgctat 
accatagtcc 
gtgtcttagt 
tctcacagtt 
ctattgtctt 
tgtatttgtg 
gtttccagac 
ttgaaactgc 
gttcagccct 
agctatttac 
ttcccctatc 
tagaacagtg 
tcaggttcca 
aatatttttc 
ccaaattgaa 
cagacattga 
tcaaggtaag 
attgtcttga 
ataagattct 
gctttacaat 
tttttctgtt 
tttttcagca 
ggcttttttt 
attaggtcaa 
cacatcatta 
ctctgagttt 
aagaatataa 
tcagaaggct 
ctgagatttt 
tgatagagat 
ctaggctttc 
gtcctggact 
ccacacgtag 
cagggacagt 
tcataaatat 
ttttgctacc 
tcttggcatt 
caatgcttct 
cttcatgata 
cagtcctctc 
tgtttgccag 
ttcaactcta 



aatctatgtt 
gtcatcagaa 
tcttggtaga 
ctttaagtta 
tttaggtaat 
aacccaaact 
catttctgag 
acttagccat 
ggattttggt 
cccaaccagt 
gctaccttag 
aggtcaaagg 
actcgctgtc 
ataggtgaat 
ctgggtggct 
tgtctctctt 
caagagagct 
cctcataccc 
aaatttcaac 
aataaaactt 
gacaaccttt 
tcaggctgct 
ctggaaacta 
tttcttgtat 
atggggagtc 
agtgtcataa 
ctggttatta 
taagcacaca 
tgatggtgtg 
tcatggagtg 
ctctctgctc 
gagagttctt 
actacttaaa 
cacatattct 
agcaaagctt 
ctgataaaaa 
cactgtgcag 
catctctaat 
aagtaaatgt 
ttatatattt 
gaaacattta 
agatcaagac 
gacaaagagt 
tgtcctagtg 
gccaggcagg 
atgactccca 
tctgactact 
ggggaggggc 
gcaagatcag 
tagtgattaa 
ctgtctcact 
gtctcaaatt 
actagatatt 
atgatagaat 
attgcttttt 
ttggtcctgc 
ccagcgcctc 
tgggcatgcc 
tctatatggg 
ctttcatttc 
attttctatg 



cccatatctt 
tggcagatga 
atgaaatata 
cagaaaagga 
atacaagtta 
ttttttttaa 
ccatccgtag 
gttttatgat 
caggtatgct 
tctgatttta 
atggttcagt 
ttgacaaact 
acaatgacac 
atgcataagc 
tacaaacata 
ctcagttaca 
ctctggagtc 
taatcacctc 
atatgaattt 
gatttacaaa 
gaattaggtg 
ataacaaagt 
gggagtccta 
cctcacatgg 
agccatttta 
ataagcttca 
atctgtgtgg 
tgccacagag 
ctgtatggaa 
gctaggagga 
aacgcatgtg 
tcagaagttc 
ataccttaat 
cattctatat 
ttcctgttct 
taattcttac 
ctgaagtaaa 
attgtgttaa 
tataaatcgg 
aaggtttcat 
actagtactt 
ttgattctta 
tacccaagca 
gccttaatag 
tgatattagt 
agggcatatg 
aatcgagagt 
tttgggagtt 
taaacatatc 
aaaaagcaat 
tgttgcaaga 
cagtcctggt 
tgtgtccaca 
caaccatcat 
ggaccattat 
ctccctcaga 
tcagaacaag 
ttatgaatac 
acatgctcat 
tctacacaga 
atccctgccc 



tgggaagcaa 
ggcctgaaat 
gtacttgtaa 
attccacaaa 
atactatgtc 
ttcattcttg 
attgagttta 
gtataaaaca 
gaccctgagc 
cacaagttcc 
gttatgggaa 
tttcctgtaa 
aactaagcca 
aggtgcctta 
tttctcactg 
gactgctgtc 
cctttcatga 
ccaaaggctc 
tgggaagaac 
agtacactgt 
atcactacaa 
accatagaca 
agtcagagag 
caaaaagagg 
acatttatta 
caactgatct 
ccaggccatg 
tcaaaaacaa 
aattactaga 
ttaagaaata 
gctagctttg 
cacaaatgtt 
aaaaagaaac 
gttaacttgt 
tcctgtttat 
cattgtaacc 
aacaatctaa 
tcagaaaaac 
ttttaaagcc 
gtaaattttc 
aactagtcaa 
cgtatctttt 
gagttcttca 
atattttctc 
ctcataaatg 
actaaacaag 
ttttcccacc 
cagaggtgac 
ttattctttg 
gtagacactt 
gtcatttgac 
tgatttccta 
aacatctaga 
tctatacatt 
tgcaggcata 
tttttttctg 
gcccagatta 
accttcgctc 
tgctgctttg 
tccaaattgt 
cagtaatttt 



ataattgact 
tctcaatgga 
cacttcctta 
ctacaatgaa 
ttttgtttgg 
aattttagaa 
ttcatccttt 
tgctaaaata 
agtcagtata 
ttgccttctc 
gttgacagac 
agtgaatatt 
tcttagaggg 
cttcaggctg 
ttctggaagc 
ttcttgtatc 
ggacattaat 
taccacctaa 
acaaacattc 
aggccagaca 
catgtaaatg 
tggtggttta 
tccctcttcc 
ataagagcac 
aacacccact 
gtattcagta 
aacccattct 
cctaggtttg 
cctctgtgta 
aaccacaaat 
accacaattc 
tgatttaaat 
atatattcat 
gttgtcagtt 
ttaaaaattt 
acttagctgt 
aattggaggc 
tttattattc 
tttatatgaa 
atttggagct 
gatctgcctc 
ttaaaaatta 
ttctccttta 
cttagatctt 
aagaaggtga 
tggtagggtc 
cgttttgatg 
cctttagcca 
tttgtttttc 
gtaaagtgtt 
atgacgcctt 
gggtctatgc 
ggcatccaat 
tttctttaga 
tcataaactt 
cataatctca 
tcctcctggg 
tggtctcatc 
tgtctctgcc 
acccatgatt 
agcccataag 



48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
48480 
48540 
48600 
48660 
48720 
48780 
48840 
48900 
48960 
49020 
49080 
49140 
49200 
49260 
49320 
49380 
49440 
49500 
49560 
49620 
49680 
49740 
49800 
49860 
49920 
49980 
50040 
50100 
50160 
50220 
50280 
50340 
50400 
50460 
50520 
50580 
50640 
50700 
50760 
50820 
50880 
50940 
51000 
51060 
51120 
51180 
51240 
51300 
51360 
51420 
51480 
51540 
51600 
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aatttttata gcacaaattt tgtacttaat gacacttgta ttatttcatg ggttagttag 51660 
tgcaatctct atgaggctgt aggcaatcaa gtaagatatt taaattgtac ccacattgtc 51720 
ctaccttccc acactgtaga cacctactaa attggagtaa tgtgtaataa aataatttat 51780 
agatttaagt aattgaaatc atcccaaggc acagagagaa caaacaaaca tattcctgag 51840 
acatgtctaa ttcatgtgcc atggttatct tgagataatt ttccactgta cttttaataa 51900 
aagagacacc taaattattt catagtttgc ttttgttttt cctccttgag cattgccatg 51960 
gttattgata tttattaaat caaatctgcc actgaagcat atataaccac tcaagaatga 52020 
attacagcaa atatttaact gcctgcctgc ctttgaattg gttaatcaga atgagtgatg 52080 
cattagctgg ttcaactcct ttcacctgga aagatggcta tcgattccct ttattgtgat 52140 
ttaaaagatt tgaggaaagt ctaaaagcat ttcaaaagat ttcaataagt aagacataat 52200 
tctgcttttc ataaccacta aatgaaaaaa aaaactaaat acaaaaaaga ttcttcgagt 52260 
aaaattctgt cattctccac tttttattgt gttaaaaaaa tcaactataa tatgagtgaa 52320 
agtcaagttc tgctggtatg tatgaaatga gaagagaggt gctgtaggtt attaataaca 52380 
ggggatgttt tgtacctgtt ctggaagatg aatatgcatt tttgtgttgg agccagtcca 52440 
taagtcaaat tcctttaaaa caggcagagg ccagccttgt tctggctgca gcctgcacgg 52500 
ccagaaaaac agaggaagaa gcgccttcca ccagcatgaa atgttatcca ctttgatatg 52560 
taaacattct tccttcacct gaagttgatg tcagggtaac taaaacacaa gaacttggtt 52620 
gactgagcag aagcaatctg ttgtattctt ctgtttttca aaaccagaga taatcaagga 52680 
ctgttatgtc agatgctagc tgacctaact tttgagtgga aaacgaacca ggaatctgag 52740 
ataactggat agtctccgca tatttttaaa aaatcagcat gtctcttgaa actgctcaga 52800 
cttagctgat ttttgtagtt gttgttgttc tctttatagt tttttttctt ttttctctgt 52860 
attccagact cctggcattt aatctattgg gtgggaataa atgacgtaaa ttgtactcat 52920 
atttgcaaaa agcccaaaaa agatgttaag gctgttaaca acagacgtct acgctacagt 52980 
tgaaatgggc taggtgaatt gagacagaat tctatgaaca ttttatgaag ctttccttaa 53040 
atttctaatt gaaaattgct actgtttgtg ttactgttga ttcatttcat ttaagtcttg 53100 
gtcttgtaga ttctattgat accccattct tggggatgaa aaacaatgtt aaatgcgcac 53160 
atttgttagg aaaatgggct gggtgggaga cggaaggctt tggccgccat cctagggcat 53220 
tcctgtgcat tgcgatggtc agcatttctg tatctaaaca tcgaaaaagt acaataaaaa 53280 
tacagtatta taatctggga ccactatcat acacgcagtc tttcttgacc aaaacatcct 53340 
tatgccgtgc gtggctgtgc tgagaaggag gagcactcgt cctctccaag cacatccaat 53400 
ttgcctcagt cctcatgact cccgccctgt taccagaact gagctcattt gttattgaag 534 60 
ttgcttttct tatacccagc ccagttttcc attgatgctt ccgcttattc tatgtaggta 53520 
ttttcattta tcacacaata tccctagtct aagacttcat ttgaggaaag aaaaaaaaaa 53580 
aaacctctgc tgctaaaaat aatttgaaaa ctcctagaca ataggacctt tcaaatccct 53640 
tacaattcta acaaccagtg aattcatgct tcacagccat tcagatctat acgtatgaaa 53700 
tacaagacat atgtgttcaa ctttccttca tcgcctcctt taaactttct tttttctttt 53760 
ttgagtcagt ccctgcagga acagaccaag tgagcccatt aagagaacag ttcattccca 53820 
cctgctacct cgagccagcc aacaatctca gcctcaccac tccattcttg attatgtaga 53880 
accattccaa tcctgagtga caggatttgc aaagacagct gttgggctcc acaaacattt 53940 
ggaggagggg aaaatgtgag gttggttgct acattctcct acttcttttt actgaaagga 54000 
acagctgcga tcttcacatg taagatgaag taaacaaaag ctgacaatgc ccagcaccat 54060 
tcagcagtaa gcattcaaat ggatttttcc tcaccgcgtt tgcaggcccg gcacagtctc 54120 
agttccccac agcacagaga ggaagcaagg tctttgcatg ccctgttcac ttggccaaac 54180 
attgccctag atgtgcaatt caacccatat aaagcaagcg taattcaaga gccctgactc 54240 
tttcaccatt tcatgatggc ttattactcg cacacggcat tgtcaggttc cagcaactct 54300 
gtgtgcagca ggcagaaatg cagtgtttat ctctgacagg caaacattct taggctcatc 54360 
ggcatgagta tccacgctga actggctatc ggggaagtag atccctgcat gggaaagtca 54420 
aggtatgttt agagggactc gatgattcaa ccccattggc tggttcattc cttttgtttc 54480 
ctctgccttt ttatttgctt acacagaaaa cagagcattc atttatttct tttccctctc 54540 
tcatctctat taaccactag ggccatctgc ccatctgaaa ctgtctttgg ctgcatggtt 54600 
gtgaatccag gtgacagtga aaatgaagaa gtggtgattg tccctgcatg ttttctctga 54660 
agggttaact aaaaaaaaaa aaaaaaaaag cctttgtgtc tgtttcgagt tctccagttg 54720 
aaatactgta ctcattgaac tgcaagaaat aaaacctgca agtaggcact gtgctaatgt 54780 
aatagaaatg tgttagacag gggcactatt tcttaaatat gcttgaatta agcaagttct 54840 
tttaaaaagt gggtgggact tttgatgttt tacatagtta tattttcaat ttcttgtgct 54900 
cagtttaaaa ctcaagattc tttgtttctg tgtgtgtcat tagggcaggt gttggatgac 54960 
aagtaaatgc actgaaaaat gaaaaaaaaa gatttgtgcc ctaacctcca tgaatttggt 55020 
gacttaatat atccatacag atctgacttt taatcaaagg tgctcacagg tggggtaagg 55080 
gataaggagg aatttgtcta caggatcctt ttatcctaag ttgccttaca acatccaatt 55140 
gtgttaccgt tttttttttt ttttttacaa acccattgct aggctaagaa gtattgattc 55200 
catctagtta aacatgtcga gtgagaggaa gtgatcaatg gattacttct tcaaaacatt 55260 
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tatcccccct agtggaatcc tctgattttt ataacattaa agagccaggt ataggagttg 55320 
ggagaagaat ttaaaatttg aaatgtcttc ttttttcaac tctgttcatg aaaactctag 55380 
ttaatgcatg tgatttcctg ggtctccctc tctacttccc tccattatca ccaccttttg 55440 
ttgctgctgg ggctgtgggt gccctgaagg agcatgagat ggccttcctc agatatctgg 55500 
gttttaaatt cagttcgtat ctcttaagtg aagtgggtag ggccaaaaat gactcactgc 55560 
ccagtggtgc ctggaaacat tatcagcaaa gacattcacc tagggagaaa ataaacacag 55620 
aattctcata tgggcccatc tttccaacca tctcctggac tttggtagtc ttttctgagc 55680 
cagatatctc tctaaagtca cttgctgtca gacctttttg tctcatattt aggaatggct 55740 
gaagttcttt ataaatccca gatgaaacag gaggaaaact actttgaaat gatcatttcc 55800 
atgatctcta aaatgcaaca aaccgtattt cttaagatta acggatgcga gcctatgaca 55860 
ggttcaatgt gggaccaact caagggaaat tgatgaccga attattaaaa ggcattcatt 55920 
acatcgcagt ttcacctgaa gagggggtgt cccccagtag tgcttctgca gcagaattgc 55980 
ttagaaactg cagaacatcc acaaggcaag caccccaggc cattgtgtct gactgtcttt 56040 
caaatagagg aatggctagt tgccatttgc tgagggcatc tcctgcactg agaactgtgt 56100 
gcatatatgg atgatatcgc cgagactcac aagtatgcaa gatagaaact aggtgcattt 56160 
ttttcagatt aggaacttga gaatggaaga taaactgaga aggggagcag gggaggatag 56220 
ggagagattg ggcaacggtt aagaagttac agttaaatag gaataaatcc tggtgctcta 56280 
ttgcacagca gggtgcctgt ggttgacaac attgcattgt ctatttcaaa gtagatagaa 56340 
gagaggcttt tgaatgttct caccacaagg atatgataaa tgtttgaggt aatggatatg 56400 
ctaaatactc tggtttgatt attacatagg gtatgccagt ttcaaagcat cacattttac 56460 
cccataaata tgtacaacta tcatgtgtca attaaaaata aaattaaaaa aaatttattg 56520 
ataaaagaat aaaaaatagc atgcttttgg ccactctatt tgataggcat catttaaaaa 56580 
acataagact ataaggctgg gcatggtggc acatgcacct gtactcccag ctatttagga 56640 
ggccgagaca agagggctgc ttgagcccag gagttcgaga tcagcctgag caacacagtg 56700 
agaccctcat ctttaaaaaa tatacgtagt atagatgatg taaaaggacc acctttccaa 567 60 
atgagtcaat ttcttcctta atttaataca ttcctaggta gtaagttttt agagagctaa 56820 
actaattcct tgctatgtct taggttacca tttataagtg ttcttatggg ggttagcttc 56880 
attttatgta ctaaacactt atttgaaagg atcgttgcct ttttccatgt gctaatttgt 56940 
atagagagtt tacgtattgt gggaatttaa tttattcctg cattttatcc aggaaaatag 57000 
agccagatgc ttttggacac ttgttatttg ctttaaataa gatatcaggt taagataaga 57060 
aaaatatatt gattggatat tgcgtcttcc tctggagaac gtaaaacctt tcacgtccta 57120 
catctcatca taccttacaa ctttagaaca gactctaata aaaggaaaat gttttaactg 57180 
tccctttact tatagagggc agggatggga aagtgcactc aattatacta acagagctct 57240 
caattaatct aaattcaaag cgaaggctcc tgttgtggat cagggaccct gaataaacat 57300 
gctttctagg agacttcacc gccaggttcc aagagaggtc acagtactcc tgcactcctg 57360 
atggcagacc cttggattcc attgcaggca gtgacacacc aagagtaaga ggagctggct 57420 
tcagccctgt ccacgtgtgt ctgctggctc gcccttctgt ctggcatctc cacgcccctc 574 80 
ctcttccgga ctcccttgca tgaccttcca aacagaatga gctgttcttc caaatatgac 57540 
cctaactcat tcaacagctg gaggcaagag caaactagag ctcatgcaaa gggtattagc 57600 
attgagtctt ttttttcccc ctctttctct cagactaata gatatttgga aaattcatct 57660 
aaaaaccttt tttaaaaaaa tttcattcat aaaaatagcc aaggggtttc tcagagagtt 57720 
tccttttagc aagtcttttg gaaggaataa atgtctactt tattattttt ccttcctgtg 57780 
cctgacctcg gagaggatta gagggtctgg gcaaggagag gagcctcaca cccagccacc 57840 
cacacttttc cttctgacga gacccaccgc catgccatgc tctgtacaag ttctctggag 57900 
gaacataaga gccaccttat gtgtgaggga ataagaccat gctttgaaga gccagacttt 57960 
ctccttttac ttgctcttga aattgttttc actctgatct gcaagtcaaa gaaatgatct 58020 
attatacatg tagggacctc aaaatctgtt tgactctgtg caaataaggt acagtccctg 58080 
ccttcttcct atagttttct tatcactagg ggacttgaaa aggatttagt tcaacaactg 58140 
agtcagcatg agctcacaca aaacaaatcg tttcacacca atgtcaattt ctttctttct 58200 
tcccccccgc ccccactttt ttatttaaat aattcggtca gggaaatccc acatgcttag 58260 
tatcattttg tcagcaaggt acatgacaac tctggtcaca atagaaagtc gtttagtgaa 58320 
ttaatgtcaa ctctggaaaa agtggctaat gtgtcttggg gagtttggtc cttgcctgta 58380 
cacctgcctg aagatcacaa gagtaagaag gacaggtaac acacactgaa tgacaggatc 584 40 
agaggttagc aatatctcag aagcctgcca tatgagccaa acccaataca aaacaattta 58500 
gcaatattaa gtaattatca cattttgtac ttcaataaaa aatttaatat catgactttg 58560 
cggtaggaaa ggccaagttt aatgttggtt aaaaatagtt tggagattat atttgtctgt 58620 
cttcaatatg atgtttcttt caaaatatcc caaataatct gaattaagag tagaaataat 58680 
atctaccatt tattgagtac ctactatctg ctaaaataac aaatttataa tgaaggtgta 58740 
actattccca ttttacagat gaaaaaaatg gaggaggtta agtaaccagc actaagtcaa 58800 
aatggctagc aaagatcaaa accaacatgt tagccaatgt taggcagttc aagtcctttc 58860 
ctctgtgcct gaaatgaaag aaattttcca ggaattttgc attggttatt catatcaagt 58920 
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ataaaaaatg ttattcattt caatgaacca 
agaggaaaat gaccaggata ggaaagagtc 
taaatattta acatgaaaaa tagaagattt 
tgaagacctg tggtcaactg gggcaaatat 
cagtttcatt acaaataagg gaagagaaag 
tgcagaatca gaaagccacc tgccagggct 
atttggccta tacgagggat tcttaatctg 
ttgaatccct ggaaattata tgaaaaattg 
ggatatattt cttcaaatcc tcaaagcaat 
atcacttcca attctacaat tctgttttgt 
cccaagagaa ctgtgtgatt gatttaatat 
aaacttcaag gtttttagcc ttgtaactaa 
taaaagagaa gcaaatttag tggaagataa 
ctgtctaaga ttccagtact gagagtactg 
tggcagaaaa atgctgaaat ctagggatat 
taattttttt aaagccactt gaactttaga 
agagactact tacttcagtc tcctaatgtt 
taagagacat attctggatt ctggggcctg 
taaggaaaaa aaattctaaa acattcaaat 
aaaatatata tataaaaact agtcttaact 
tggtataagg caagaccgta aggaggtctt 
ttacgagttt taaagtatga attccacacg 
ttttaacatt ggtcctcata gtcttaattt 
tatgaagaac aaagaagcac atggctgtgt 
gtctgttggt ttccttttag atgttgtctt 
tccaggagtg ctcatgaatg catcagtggt 
ctgtgtctca aaaggatatt ccagatccag 
tatatatata tatatatata taaatttttc 
aaggtcaagg tgggattgag acaagaaaaa 
ccttaaaaaa ggaactgttt tttaataatc 
tatatacgtg tgtgtgtacg tttgcatgct 
tcaaactaaa taaaattttg attgggggtg 
taacttcagc tttaaacact aaaccaaaac 
ctttataata tatggcctta gagtagaagt 
attaaaattc agtttgctgt cttttggaaa 
gtcttaagaa atacacacct gagctttaaa 
tcaattcatc tgggagtcag agcacaagaa 
ctggtgtgtt accagggaga aaagagctcc 
tattaccaac cacttataaa aacaaggcta 
ggaagcctaa gagtgaattg aaatgtggct 
ttaatatcat gtttaagttc tcttagttaa 
agaaaaatat ctataatcca. gtattcagaa 
ctcactggaa gacattacat ttctaaacac 
ttttagttca ataagggaat tattgaccgt 
aaaaaacata gtacattgaa cagaaaacat 
atgactaaga tgaaacactc caacatatac 
ggttcttttg gttacttctt ttttacaaca 
agaaaaagcc actatattgc ctcagaacta 
attgactgta atgagaccac agacagattg 
gtacttgggg tgatttagtt atatagctca 
agtattagtt aaggcaatgc aagctgatgt 
aaaaccttaa aggtgcaatt tttattcatg 
tcaggcagcc agaattcttt catcctgtgt 
ctgcatttat ccaagataag agaaaagaaa 
ttctgctcac ttgctcttgg aattaacaaa 
tgagaaatgt atttcctgat gggacaacca 
ggctcagctg caacacctgt aataatactg 
acaaaactgg gaccttgtct tccacgagca 
attcccttac caataataat agttaatata 
tattggacca gaggctgtgt gggtggtatt 
ccaaattcat gctcttgcct cctgccctcc 



aaattaagtc aagacactta aggactgtct 58980 
tggaagccat gcaagaaaag acacagatgg 5904 0 
gtgtgggata taggagatac cttcaaataa 59100 
tttaacccca aataagttct tcaacaataa 59160 
tgcccaaaat gagcattctc agtcactaga 59220 
gttattgagt gaaattctga ctttgaaaac 59280 
atatccatgg aagaacttta ggtaaagtct 5934 0 
tatatgtgtt tttttttctt gggaaatggt 59400 
tctacaatat tggtggggca gtgaaaggaa 594 60 
aaaagtagtc ttacccttca cagaggtgca 59520 
taggggtgaa tgaacataaa caattgaaga 59580 
gatgttcata caatttttaa aaatctttaa 59640 
ttaagtagaa ataaataggc aaactgaagc 59700 
agtacgtaat tcaaatattc ttttacatca 59760 
atgctgacta gtaactgagt cttatataaa 59820 
accataaaat gttataacac taactgttct 59880 
acaactgaag aaatcatgtc tcagaaaaac 5994 0 
gttcctgatt cacgtctctg tctgcaaatt 60000 
ctggaataaa cctatgcatt tttcaagaag 60060 
acatataata cacaaaggtc cactttgtta 60120 
atgcctcctt atgaggcata agattgcccc 60180 
tcagcatgct ctgggatgtt ttcacagtat 6024 0 
acctctggaa atttccatgg atgattaaac 60300 
tatttagagt ttagtgaatt aggtctgcaa 60360 
gtccgtttct aagatcagag ggagaccagc 60420 
cttccatacc atctgatgaa aagattcatt 60480 
ttctgagtca gcaggtcttt ggtgtatata 60540 
cactgcttta tttttttaaa ttcatctgac 60600 
tttggaacta gtcccatttg agacttattt 60660 
tgaaagttta cataattttg acatgcaagt 60720 
tgtgtgtatg ttcatttttt ttaaagagac 60780 
ttgctgaaag gactatagtg tcagctgtta 6084 0 
atgggctaca tgttccaatt catactgaag 60900 
ataatgatct cttgggagag gcaagatcca 60960 
gacataagca gttatatatc ctttgagacg 61020 
tggccactca tttgactgtg aagttgaatt 61080 
agaattcaaa taacaacact ctgatatttc 61140 
tggaaaatga acgccttaca catttgcaga 61200 
ttgttatgca taaaagtcat tccttttagg 61260 
gacatttcta ccacaagata aaatgttttt 61320 
aaaagagaaa agggaaaaaa agaagaagaa 61380 
gcaattccag acacttcacc catgaaataa 614 40 
aaaagctatt agcagccttt tctaattctc 61500 
attttgcaat cccacatgtt tttaaaagac 61560 
acgaatgctt tctatttaat ttattagttc 61620 
aataacctcc ccacccccac ttcactctga 61680 
aaggtttgat gtattcttta aaatataaaa 61740 
ctccatctca ctgcctcact ttaaattctg 61800 
atgaaataaa tttagatact aaatagcatt 61860 
tttatttctt caacaaattt gttgaacagc 61920 
cataaataaa cccccaaatc acaatggcct 61980 
gaagcagggc aatgtaagta actacaggat 62040 
ttccatcatc ttctagagct tcaaagtctt 62100 
tttgagaagg tgtacctgtt tttagccatc 62160 
tcacacaacc atacctaagt gcaaagtggt 62220 
ccttccagtg ataacttcac actctgaaag 62280 
tgctggacat tgtggaattt tctaaaagaa 62340 
tgcacgttag caattttttg caaggaattt 624 00 
tatataaagt ttgttatgta tcaggctcat 624 60 
tcaggactca aaccctggta tctgcgctgt 62520 
gtagacttct aatcacttaa ggcaagagac 62580 
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tcaagtctag ttctcttaga agttcctacc 
gaatgtggat tatactaaac aaaaccctag 
aattagagct caattccaac tcataaaaag 
gaaagctttt taaagatatg agttgcctat 
tcaattgaga tgcaaatgct aatttgagta 
cacctaaaaa ggtacaagag agagcaagat 
cagaatgttc tatcattttc tttctgcaca 
ttcacatgat ctcttcccat ctaatttgat 
attcctccag ttcttgtttt ctatggtaat 
agaggaattc taggcacttc tcaacttgga 
atagttcatg ggcataagtg acttgaatat 
aatgtataga tgatattcta tcagacaaga 
aacttttagc taggcctttt agaaacatgt 
aaataggaaa ttgtcattta gaagggcatg 
tcgagtgctc cgatgcaagg tagggtatga 
gtgcaagaca acacctattc ccatataggt 
aatgggcatg tacagtgcat ccacctgtac 
tatctcattt ggcactaatg catgtaaagt 
gtacaataag tgatagctgt gatttaaatg 
tcaaattaaa gaaagtgagg catggtgagg 
tccccacaaa atttatatgt tggaacctaa 
gtagggtctt tgggggttga ttaggtcatg 
taataataat tagtactctt ataagagagg 
tgtgaggatg tagcaacaaa gtaccatctt 
tgaatctgtc agtacattga tcttggactt 
tctgttgttt ataaattatc ccatctgagg 
tatatccaaa gccccttagc tggagaaacc 
atgagattct ttgtgaaagt tgctctcctg 
cttagttcat gggttgttat tcctcaatct 
ttttaactgc ctgcccctgg ttctagttgc 
aattgtgtcc tgagtgccta tgctgcagct 
aacttccagc tatctatctt aagcagagta 
agaggtcaca tgcgaggcct cccatctgtg 
atgattcctg tgcagcagta gtcatgaacc 
cccagggccc agagagtgtg gccctgagag 
cgccagattg atgagattat cttgggggta 
tggagggagg ggaacagata ggctcagatc 
tgcaggtttg tagagaagtg tcttctcctt 
ctgaaactgt cctatctccc tccattccag 
taactagagc caggggcagg cagttaaaat 
gattgaggaa taacaacaac ccatgcccca 
tccagaagaa acagatgaca actttaacaa 
aaatagcatt aaaataactt tccagcattc 
attcagttaa ggaaaggatg gcctaggatg 
tagggtaaat accaaatgag tcgttttgat 
tgttaaaaac acagacttca acttacccag 
gccaatgaaa gtgcattttg gttggaagtg 
ctgtggaaac tggcagtaca tgttgaatga 
agggatggtg gtaaggaccg cgggagagag 
tagagtttat tgatagtgac ctaaaaatat 
atgatcccaa gttttcacca gatttagaca 
gcaagtctaa aattttttaa agtttagctt 
gaagttgaag tatcgcgatt ttgcaaatgt 
ttccttcact ctatgtgtgc atgtttgaaa 
acttttttaa ccactgacaa tgatgatggt 
cagatggatc agtttgtcaa ggtctgagta 
tgtagtggtg tgttcaattt gataaaatca 
aaccactcca ctaagggaga caacctagta 
cttctgagtc acagatgtga aataacagga 
ctcctgatca tttccagtag gttatttcct 
gtaatggatt tgaactaaat gtaatcaaaa 



acagggcaga gcccacacca gacagtgaag 62640 
gatcgattgt ataggtaact gttggcaata 62700 
ctgtactcca aaatacaatt caccataaat 627 60 
actataaata ttgatttttt ttcatctgag 62820 
gatgagcata tgagcattag tggaaaaact 62880 
gcccaagata gattaacaca acactttaac 62940 
tacaccttct aacttgcaaa tctgtcttgc 63000 
gggtttttct ctgaaagaaa tgaagggagg 63060 
tatcctaagt tttaggggaa gaattttaaa 63120 
gcagaataga gatgtattcc agtaaggaat 63180 
gaggtagaat tgggggtaaa taacctatga 6324 0 
gatattacct atagttttac aaaagaaata 63300 
aaaattaact gagtttacca agtatagaaa 63360 
atctattcag ctagccaacc aagaaatatg 63420 
agagcagtgg ggagaagggg gcagtgaaaa 63480 
cagcagctaa gagcaacaac aaaagagaac 63540 
caggtaacat gccaggtgaa ttattcaaat 63600 
acatagcaag gtgcctgtca cagaaagaat 63660 
ccagttttat gtggaggtta ttatctgttt 63720 
ctatgtgatg ctataaactg aatgttttta 63780 
tcacgaaagt aatcgtgagg ccccgtaaag 63840 
aaaactccac cttcatgaat gctgttataa 63900 
actgggggag ctcatttgtc ccttctacta 63960 
tgaagcagag agtgagctct catcagacat 64020 
cccagcctcc aggaccgtaa gcaataaatt 64080 
tattttctta cagtaggcta agatagatga 64140 
agacatacac gcatggaaag ttcaataaca 64200 
gcagttctat tttactagta gaatattcat 64260 
ggctgctgag tgccctttgg gtgtttatga 64320 
ctgggttagg caatgagggt attttgtgca 64380 
ccccatgagt cttccattct aattctaatc 644 40 
ccttttagga cagggtgata gggggagcct 64500 
gaggtttagt tggagagagc tgacaatgag 645 60 
tggggcagga atgagagcac catgctatct 64 620 
acagatcagt gaaatctcta aagagcaatg 64680 
cttgctagtc tctttggctt cggaggggcg 64740 
ctctctgccc ttgaggctgg tcaaacatat 64800 
gtacttgtac agatgatttg ctaaccggac 64860 
tgactcaagt gtcctcagag ctaatccagg 64 920 
cataaacacc caaagggaac ccagcagcca 64980 
agatgaacat tctactagta aaatagaact 65040 
agaatcaaaa gtatacttac aaaaattgaa 65100 
aaaaacatgg cttttcaatt aaaaacatca 65160 
agacagactc acttgcaata taagacatta 65220 
tctcaggtgg aattgaccat tagactcacc 65280 
ccctaataaa tcaggaactc tggaagccga 65340 
gccactggtt ttgggcactg gttatggggg 65400 
gttatcaggg tgagagatca ctgggtctgc 654 60 
atgagaactg aaatgagcat gagaggtggg 65520 
gaaacaagga ttgatgaatg ttttgtctgc 65580 
caaatacttg tcacacccgt agatcaaata 65640 
taaataatat aatacacatc agataaataa 65700 
ttaggaagct tgtgttcatc cggttcacat 657 60 
ctgaaatata caagttaaat ctttagtctt 65820 
ccatttatct cccaatgttc catttcaaga 65880 
ggaataaatg gagtgctcca aaggcaaatc 65940 
cgcttgcata tttttttcct aaatttcctg 66000 
cttcttggtt tttttctctt caaatggctg 66060 
atcaaaggat tcatggatca agaaaaatga 66120 
tggagctgtt ctgctctgcc cccaggaggt 66180 
attttcacct gcatccttat ttcctgaaag 66240 
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tagctcagtc ccatgaagct gaactttctg 
cttaggggaa ataaactcga tgtttctctg 
aatgctctat gccaaacaaa aagcactctg 
atcagctctc agagggtgat agagcacatc 
tcattttatt cataactctc tgtaacttag 
aacaaagtaa caactgggta gtttagacaa 
ctagaagttc aaagtcaaag tgttggcagc 
gatggctgat ttcatgttca cagggcattc 
atctgtttgc aaagacacca gtcatagtgg 
cttgattacc tctgtaaaga ccctatctcc 
ttagaactta catctgaatt tgaggaggga 
cctttttttc ccatgtcatc tattaacagc 
tggcttcaca gataccatgc tcttcctcaa 
gcagaatcac tgggagtgct ttttagaatt 
tgtaatccca gtactttggg agaccaaggc 
accagcctgg gcaacatggt gaaaccccat 
tgttgtggca tgcacctgta gtcccagcta 
agcccatgat cctccactgc actccagcct 
aaaaagaaga attatagatt cttgggtctc 
aggaggaact caggaatttg caatttacaa 
ggttggagtt ccacagccag ttagtgctcc 
taaatcttat aatagctacc taagaagtac 
ttgttttctc atctctgttt ttattacatc 
cttgatgtgc atagattctg ccttactcat 
cacacataaa taacattcaa taagcgttta 
gcaaactgta caattgtttc atcatttaaa 
cctgtatccc agcactttag gaggccaaga 
gaccagcctg gccaacatgg tgaaacctca 
ctgggcatgg tggtgtgcac ctgtggtccc 
acttgaacct gggaggcaga ggttgcagtg 
gggcaacaga gtgagactcc atctaaaaaa 
aaaaagaaga aaaatgttat tgacattgac 
tcttaagagt ttaacccgtg cagccatgcc 
tgatagctta cctgtggttt caggacacat 
cacatgctat tttggaactc ttctgcctac 
cgactatttc caaatgaata tttgttctgc 
acgttttcat aatgcttcat tttttttccc 
agttcaatgg ttgctctctc tgacaaactc 
ttgtgttcat ttttgaagcc acagaaactt 
gtcgttatgc gctttaccat gattatgtgt 
tacagcaggg actagattta atttattttt 
aggcagaaca agcacaataa attttcttaa 
ttgtttagtt tttgccttga tgaaagcctg 
ggtgatcata taaaaataga tttgttcttt 
tgtcaaataa aaaaataaac tagaataaca 
aacagctgac atacttctaa aaagcagaaa 
ttagttttcc aattgccaaa agatcacttc 
ggagaaactg caacacttat ggaatggcaa 
aaaacctgaa ggagacaagc ccagctatag 
ctaaagttac ataagtgatc aggaccgtat 
acgcagctgt ttaactctct caaacatttt 
gttgtctatt cccataatat ggaatattgg 
ggcaatcatg taagataagg ggaatgggca 
cttttatctc tttccagtat ccccactgtg 
agttttgcat tggaccaaat cactgcctta 
ttaagatctg caatgggttg ggatgatacc 
tctttggcct ctttggcaaa ggatatgaac 
gagtctggct caaagtggct aagactagaa 
ggaaaatagc ttggccttta gagagaccca 
cctggtccag caagaccacc atcacagcag 
ttttccactg agacattgtg aaaaccacaa 



cagctgtgag acccacggca tggctgcttc 66300 
agtcagaacc acagaacaac agtctcatga 66360 
cactcagatt tgggatgaga ttcacatgcc 66420 
caagcttctg gagcccctgc agcagagaaa 66480 
tcacttgtat tcatttttga gtgctgcctt 66540 
cagaaatgta ttgtctcaca gttccagagg 66600 
tctgtcccaa gcttccctcc ttggcttgta 66660 
tccccatata ggtgtctatc tccaaattcc 66720 
attagagtcc atctaacgac ttcattttaa 66780 
acataaggtc acattctgag gtactgaggt 66840 
gatgtaattc aactgattat accactgaat 66900 
ctgaagaaca cgtttagagg gatgttactg 66960 
tggttcccaa atgccatttg cagaatatat 67020 
ctaaatctgc caggcatggt ggctaacacc 67080 
aggcagatcg cttgagccca ggagttcgag 67140 
ccctactaaa aatacaaaaa attagctggg 67200 
cttgagaagg ttaggctgca atgagccatg 67260 
gggcaagaag agtgaaacct gtcaaaaaaa 67320 
acctccacaa attaaaatcc actggatcta 67380 
agttgcccag aacattctga tgcatcacca 67440 
cttattattt tttagacttg atttgggcca 67500 
acttcttctt taaacaatat gaacatagaa 67560 
ttctatcttc ctccactgca gtacaaacca 67620 
ctttttattc tcagtgtgta ccagagaatc 67680 
ttgaagaaat aaatacctcc agcaacacag 6774 0 
gaagttcctg gccgggcgca gtggctcatg 67800 
caggtggatc tcttaagctc aggagttcga 67860 
tctctattaa aaatacaaaa aaaaaggtag 67920 
agctactcag gtgattgagg caggagaatt 67 980 
agctgagatc acgtcactgc actccagcct 68040 
aaaaaaaaga aaaagaaaaa aagaaaaaga 68100 
atatctttat acttgagaga attcagctac 68160 
cgctgtcctg actcattttt actcctgcca 68220 
ctttcttttt ccagcctcaa ggatcttttg 68280 
cctctttctc caccccatcc ccagttgcca 68340 
ccttcaaatt agaagtggtg cccctgttaa 68400 
atggagacac tgatcatgct tgtaatgaat 684 60 
taatctacat gtaggcaggc ttcatgtctg 68520 
gaacagagta ggaactcaaa tattaattga 68580 
tttttcccac cattaaactg ggagcttctc 68640 
ctatttttgg cacctggcac agtgcctgga 68700 
aatttgtaag gtttttgttt atttgaatgg 68760 
ggatagaaat ttagaacaag gttggatgtt 68820 
tttgtcatta gagcccaaac ttaaaaaata 68880 
gggaacattt aatggaacca aggcacctca 68940 
gtaatagtaa gggagataat agcatacgag 69000 
aaatacagct cttggcagtc atcttcttat 69060 
ttccaccaat cccctttttc taacagcaga 69120 
tccagctggg acctgtggca ataacctgac 69180 
gtggtccgta cccctctgcc tgaggatgct 69240 
gttcttttac tgacaaccca ctgcagtcaa 69300 
gtctatagta atttcctatt cctatttcta 69360 
cctatttctc tggggaaggt ataaatgtgt 69420 
tagagctggt ccacaatgga gaggaggaaa 694 80 
gaaggaaaga gattgcaatc ccattcattt 69540 
catttaaaaa agaaaaaaaa agaggttacc 69600 
tagtaataaa gcctcagcat caaggcttgg 69660 
gcatcttcta acataaatat ggcagcatta 69720 
gattatcttc accttaaaaa ctaccaagac 69780 
taaatacata gccaccaact tcacagaagg 69840 
tccattaccg gtggatgagt cccagctctc 69900 
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ttactacctg 
atcctggacc 
gtgatagttt 
gatggagtat 
ttgctgtggg 
ggaactgaga 
gtccacaagg 
ccagtgggga 
aatttatatg 
agtcattaaa 
gactggtgtt 
gcgaagaaga 
agccaaccct 
tatttaagct 
attgactgca 
tcctgactca 
gaaaatatgt 
acaacagacc 
cattctctat 
agtactgtaa 
tgggtgtgta 
attatgaagt 
ggctttatgt 
cctcttaatt 
ttccataaaa 
ttgtggtggt 
cttttcgcag 
atttgcccca 
cacattcctg 
aaacaaggtg 
tggcaatttt 
aattcagaat 
ctacaaaact 
tgcctcatga 
taatatttcc 
tagtcagtgt 
gttcagtcag 
aattcctaca 
gtgtacttgc 
ggctgaaaaa 
cctggggaaa 
ttttatttta 
tttaatgaaa 
ttccagggat 
cacatagtag 
gccagggtgt 
tgatttctta 
gtcatgttca 
gctctgatgt 
gtccagattt 
gaggtgactt 
attcaggtca 
cagaaaagtc 
cctgaaagaa 
ccgcctgagg 
aaaaatacag 
gatgacaagg 
caaaaccctg 
gggtgaggac 
acaattttaa 
aaatcaaaag 



gagcttccaa 
aacacaaaga 
aaatatatat 
ttcaaccaca 
aagacagctg 
ctggcttcta 
aactaaatcc 
cttaaggtac 
tttgagtctt 
gaggtaatga 
cttatatgaa 
cacagagagg 
gcagatacct 
acccagtctg 
gccttgcaag 
cagaactagg 
tatgtggcta 
acacatatga 
ttatagatac 
catgctgtac 
gtaggctata 
tgcctagtga 
ctatcatgca 
ctgttttgat 
aacacaatta 
ataggaagca 
tcacctctat 
aaataatttt 
tgagttctgc 
aaaatacctc 
cttgtggttt 
tttatattct 
tgtctattct 
ggacttttca 
gcatgtagtt 
ccatgtccaa 
aattttattt 
catgatttaa 
attagccctg 
caagagttcc 
gttctccaca 
ttattttatt 
aagaaccacc 
ttgtttccct 
gtgtagcagg 
ctagtcggga 
tctgcctcct 
gttattatcc 
ttgcagaggc 
gaactaattt 
ctgactcaca 
aaggaaatga 
aagataaatt 
agaatgaaag 
gtaaagaggg 
agaaacaagc 
aaggcggatc 
tctctactag 
ataaatttat 
tttagaattt 
caaaacaaag 



gcaagactta 
gaagaggaaa 
ttttaaatgt 
atggcaaaga 
ccatgttgag 
tccaacaacc 
taccaacaac 
ctacaacctg 
aactcccagc 
agttaaaatg 
gaggaaattt 
aaacggccat 
tgatctaggc 
tggtgctttg 
ggaccctgaa 
agataacaaa 
tgttgttttg 
tggtgggccc 
acaaatactt 
aggtttgtag 
ccatctatgt 
tgcatttctc 
tttaagatat 
cccacaccaa 
caatgggcct 
gtcacagcac 
tatttgaagg 
accctgaagc 
aagtcactgg 
aagaaatctc 
ctatcctcac 
ctcaaaaaga 
gcctaactag 
tatcagaccc 
atgtatgaat 
tttatttttt 
aaaatatttt 
tttaatggat 
atgttcccca 
cagcagacca 
tttggttttt 
ttattgtatt 
tggaaagtct 
atgctagtgc 
gagatgttaa 
tttatccata 
atacctattg 
ttaaaccaaa 
caggaaattc 
cagaaaaaca 
gaatgaacaa 
acagaaggat 
agtggcagga 
gaagcagatg 
agagaaagag 
cagggcacgg 
acatgaggcc 
aaaaatatgg 
ttgctatgtg 
ctttagctcc 
ctattgcaaa 



cctgtccttt 
gcctcaaggg 
tagctttttt 
atgaattacc 
aggtgccttg 
agtgaggaac 
tacatgagtg 
gctgttatgg 
acttcagaac 
aggtcattag 
gaacacagac 
ctacaatcca 
atccagaatt 
ttatggcaat 
ccagaggacc 
tgtttgttgt 
cacatgtgcc 
ataagattgt 
gccactgtgt 
cccagaagca 
ttgtgaagta 
agaatgcatg 
tgcaatatag 
ccaagatgca 
aatgtggttt 
cctgtataaa 
aatggtccaa 
cattgaagct 
gagctataga 
agaagccagg 
tctagatatt 
gaattctcca 
atttatcata 
ttagataata 
aattttcatt 
gtgtccttgg 
gatgaactag 
aaaataaggg 
gagggaataa 
atattctttc 
cagctactcc 
ttttaacagt 
agtgacattt 
tccattcacc 
gatggaaact 
aacatcaata 
atctattttt 
acaatctctt 
ttgctctgag 
aaacaaaaac 
gaaggcaggg 
gatggagtgg 
cagaggaaaa 
aagattctag 
gaagcagtga 
tggctctcac 
aggaatttga 
aggaatgttt 
aatgtttttt 
aaatcaggat 
caagtgggag 



gttccagatt 
aaataaaagt 
ttaaatactt 
ggcagtgtca 
tggagaggct 
taaggcccta 
agcctggaat 
gctaaactgc 
atggttttgt 
gatgagtcct 
acatatacag 
aggagagagg 
gtatgaaaat 
cccagaaaac 
cagttaagcc 
tctaaggtgc 
acataatgat 
aatactgtgt 
tacaattgcc 
atagactata 
cactctatga 
cccgacatta 
ttaacttatg 
aaactaattt 
atgtcatttt 
gctcttccat 
ccaacctcaa 
taagtttcag 
gtcttctacg 
ggcctgcatc 
tgcttttgag 
aattctataa 
gcacgtacca 
atattccctc 
catctatatc 
tgctcagaaa 
atttaaagat 
actgagcatt 
tatgtgaaga 
tttctgcaag 
acccatgact 
aaaagaaaaa 
ttgaaggcag 
cttgcaccac 
tggccctgac 
tttatcacta 
cacagcttat 
cttgaaaaat 
agttccagaa 
aaaaaaccat 
gaaaaaataa 
cccagaggag 
gaactaaaga 
agcccagctg 
atcaaatcaa 
ctgtaatccc 
gaccacgctg 
gagagattgt 
gagactcatc 
cattagctgg 
ttctcataac 



ggttcctggg 
gaaggtttca 
cacagtgatg 
gagtttcttg 
catgtggcga 
aagtcctata 
ctgattttcc 
atcctgccca 
ttggagacat 
aatccaatag 
gaaagaccat 
cctcagaaca 
acatttttgt 
taattaacta 
atgtgcagat 
taaatgttgg 
gttttggtca 
ttctattatg 
tactgtcttc 
ccatacagtc 
tgttcacata 
agtgacactt 
ttttaataca 
tttcgcccat 
ccctgaagat 
ggcctgcccc 
agctctggaa 
accccagttt 
tagatttggg 
tccaagcctc 
actgattttg 
gcttcaggct 
aagatggccc 
atgtaagtaa 
atgttggcca 
gttaccagat 
agattaaatt 
agcctgaact 
atactgcata 
ctgtgagaca 
agaaggaaga 
ttattgttat 
ggaccacttt 
ttcagcaaag 
cataggccaa 
gatgatatat 
agtggggagg 
aggatgctat 
aaaaagattt 
ccaaggttaa 
gtcccccaag 
gatcacgtga 
cctgcaggat 
aatagccagg 
aaaaggatag 
agcactttgg 
gccaacatgg 
gcactgccag 
tagttccttt 
gagttcttat 
cttaatcaga 



69960 
70020 
70080 
70140 
70200 
70260 
70320 
70380 
70440 
70500 
70560 
70620 
70680 
70740 
70800 
70860 
70920 
70980 
71040 
71100 
71160 
71220 
71280 
71340 
71400 
71460 
71520 
71580 
71640 
71700 
71760 
71820 
71880 
71940 
72000 
72060 
72120 
72180 
72240 
72300 
72360 
72420 
72480 
72540 
72600 
72660 
72720 
72780 
72840 
72900 
72960 
73020 
73080 
73140 
73200 
73260 
73320 
73380 
73440 
73500 
73560 
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aaggcttgag ttgcatttcc aaatcctgtc 
tttccatgaa agcccaattg aatacaatca 
aaagctactt tcaaatccga gtcttctttg 
tctcataaac cttaatgcaa gggaaaaaaa 
aaagaagcag gataggaaaa aagacaagcc 
acctggtctc ataagcacat gagagctagg 
tcaaacaaat tccagggaaa gacactcaat 
tgcactcatg tactggaagt gcttaaactc 
gtcacagaca acccattgcc attggtattg 
aaacattagc cttaacagca gagccttcct 
tgggagatgt ttatgttcac atgaacataa 
tctccagaaa aagcaaattc catcttttta 
attaatgcct ttggctcatt gtgttgtaaa 
aaaaaatcat tgaattcaga ttataacaca 
tttaatgagg ctgagtaggt aagaggaaca 
atgtgaaaca actgtattaa atttaattta 
cttcttccag aaaggattta gggagcttag 
catgaaagcc tcagattaat atctagaaca 
aatgcattaa tctaagcttg gaaaggactt 
aaacctgcat tttttagtaa gctattttgt 
ttcctttaaa aaggagagag agaaaaaaag 
tcttatcacc ttatcattca gcagcctatt 
agattaagcc ctatactcag cacaggaatt 
gctatgtaaa aaacagaaga gaaactctat 
gtatgtttca aatctaatat ttaaaattaa 
gttttcatac tccatttttg tgacaatagc 
tcaataagag gaagagagta ccatcattaa 
gagattaaag aatgacaaaa atatttatat 
ctctatcgcc aaggctggag tgcagtggcg 
caggttcaag caattctccc acttcggcct 
accacacctg gctaattttt gtatttttag 
ctggtctcaa attcctgacc tcaagtgatc 
ttacaggcat gagccaccac aactagccaa 
ttttctgcct aattacttca atatttcatg 
ttctctaaat atatcagtga gttgaggtac 
caacactgaa gcagacttac tctaagatcc 
agattccatc tgcaagatgt agtggagcca 
atatctttct gtagaggtga gagagtattt 
actggagatt tggaagcttc actaggtttt 
gctcattcta ctactactag ccatcttccc 
aaaagaaaaa tgagtaagta taatctcact 
ctttctacaa ggtttattac taatctatgt 
gagtgatgaa taaatagcat gctgttcttc 
gacaagaaga aggaatacag gagatgtttc 
agaagtatgt cagaaagcga aggttacctc 
tctgttttct agggaaagta tttcatcatt 
ggaggaagac aagccatcgt gtcaccttta 
ataacatctt atatgatcga aagaggaact 
atctaaaact gacaattatg ttaatccctc 
tatttgccat ttcaaaatag ctaatgagag 
actgctcatg gtttccttcc ctgggctccc 
atgtcataac gggttcatcg cattacgaat 
tttgaggaca gaaactgtga tttatgggca 
gatacagagt ggtgatagat gattttatca 
acctaaatag aggttacttc ctgtttcttt 
tatgcatttt taatgcactt tacaaatatt 
tattattggc tccattttgc agataagaca 
caagatcaca cagccagtga cagaactagg 
gccatttaaa ggtacgctga gctccacaac 
catattacta tggtaggttt ccctgtctcc 
actgtattgt ttacttgcaa aatcatgggc 



tctagagacc atgagttttt aatgatagta 73620 
ccacccagac caaagtgtag gggaaatcat 73680 
aaatttaaaa ttatattcca tgataatggt 73740 
aaaaactaca gcagaataaa gaagatttaa 73800 
agaggctagc tatatctttt cataaagcac 73860 
cccagaagtt tcgagccttg gagactcact 73920 
taagtgttca ggacaattgt atatccccag 73980 
tgccctcagt ggtcaataaa atggtattaa 7404 0 
gctgatgtct ctggtagttt atgtgatgac 74100 
ttctttgatt tccgtgtgaa attctgtggc 74160 
ctgtaagcaa accatcttat ggtagacatc 74220 
aatagaaaaa aaaaacagca gcaaaatgaa 74280 
attatccttg aagttatttc acactgccct 74340 
gctcttatga agctgttagg cagttgcatc 74 4 00 
gatactctag attgaaagcc atcgcaaaaa 744 60 
atttaaccaa tgggtgtatt tattccctgc 74520 
gtaatgtgga ggtctgaagg tgtagatact 74580 
tataatgtgt ccagatagtt ttcaattatc 74640 
ttaaactgca atgaaacctc aactagtttc 74700 
catttggaga cccacacaaa aaacctattg 74760 
aaaaacaacc tgctggtgtt gctttgttca 74820 
caggatcaaa tgtgggaaag tctgtgtgtg 74880 
tcttggataa atgtctttct taaatggata 74940 
tattacaatg cagttgggtt tttttgtttg 75000 
gaaaattcaa agcaaaccat ctgtgtggat 75060 
ttgcttaacc gagattatta gtggattctt 75120 
gatggtatgc ctccactagc ctttattttt 75180 
atattttttc attattaaga cggagtctca 75240 
ccatcttggc tcactgcaac ctctgcctcc 75300 
cctgagtagc tgggatcaca ggcacccgcc 75360 
tagagatggg gtttcaccac attgcccagg 75420 
cgcctgectc agcatcccac agtgctggga 754 80 
aaaatatata ttttctttaa aaaaaagtta 75540 
aacattgtgg gggaaagaat attcatacag 75600 
caaatacaga attctattga attctaattg 75660 
tgctacctgg actaaccaca gactataccc 75720 
aacatcacat cactgaggaa tatttattat 75780 
atagatataa caaacaattg gtcagatact 75840 
tcactaaatt tgttgatagc caactctaaa 75900 
ttggacaatt tcatgggcat cgcttcactg 75960 
agtacatgaa taactattat aaacacaata 76020 
catttaagtt ctagatttta cagtattttg 76080 
tatagctagc aaacaagact cagctttgga 76140 
cttgaatgag tcttcttcct aaactctgct 76200 
agtctccaag tcatgaaaac atgaggtctt 7 6260 
tcctagtgtg gctttttgca caccccagaa 76320 
catggtagtt gagatgcttc acatttgtac 76380 
caaactcgac aggggagaat aaggaaatac 7 6440 
aattccctgg gcccccaaaa tagtctcctc 76500 
ggtaattgga catgtagaca ctgaatttcc 76560 
agtcacccgg aaggccccgg attacctcaa 76620 
tacctccttc tgccaataaa ctacaagttc 76680 
tttgtatccc cagagattca aagagtgcct 76740 
tcttatcatc atcatcctca tcatcgttgt 76800 
tatttaacaa taacctatta taatatatac 76860 
gtcctcattt aatccctgta aggtgaatac 76920 
ctgagtcgtg aaaaggctaa gtacctcaca 76980 
atttggctgg actcttaaag actctggttt 77040 
actgccttag aacttgctca gagcaattgt 77100 
cttaagactt aaagcttctg gagatcagaa 77160 
ttaggagcca gaaagatctc gatctaagtc 77220 
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ctgcctcatc tgcttcttct tcatgggatt agtagactga ttttcccctt gcttagccat 77280 
gcagtttgaa tggaattcac cccattctca gctacaacca cggatcttgt ttgctttaaa 7734 0 
ccaatcatca ctgcccaatt ttctggcctt tgattgattt aaaggtgaac acatgaccgc 77400 
tttggaccaa tgaggctcag gtggtatttt gtgagggtct ctggaataga aaatcatttc 774 60 
tcttctgaca gaactacaag gcaaggcgca atcactgatg gtgaaagata aagatcatta 77520 
tattaggaac tgttggctgt catactgcta atgtgaggtg agttagcctt agaaagagga 77580 
tgctaccttg aaaggcaggg cagatctatg gaaataaact gagtttgtgg tgacattgat 77 640 
agccacttta gttcatcagc tgaatcccaa tctacttggt tttttcagct ctgtaagcca 77700 
acaaataccc tgctagtgtt ttagcctatg ccagttgttt tttctgctgc ttgttaataa 77760 
gacacataca gtcttggaca aattatataa aaatatgaat ctttggccag gcatggaggc 77 820 
tcacgcctgt aatcccaaca ctttgggagg ccgaagcggt cggatcacag ggtcaggagt 77880 
tcgagaccag cctggccaat atggtgaaac cccgtctcta ctaaaaatac aaaaattagc 77940 
tggatgtggt ggcacatgcc tgttgtccca gctactcagg aggctgaggc agaagaatcg 78000 
cttgaacccg ggaggcagag gttgcagtga gctgagatca tgccactgaa ctccagcctg 78060 
ggagacagaa tgacactcgg tctcaaaaaa aaaaaaaaat tctgaatctt tgaatgttat 78120 
cgtatagctt tttaaaagac tagattaagc tatggatgta agcaccaatg cattttttta 78180 
aaagtatatt ggatgaatga gttaaagtct aagtgaattc agcaagtttg taacccttga 78240 
gctcttgtga gctcaatctg ccctcaatca tctgagaatg aacaaaacag actcaaccat 78300 
tttgtgtcat ccagatgaat agaaagggcc tgaatatatt ctttttttat attttttgtc 78360 
ataggcccta gcgttttcat ctgctcagtt tttgagtgtg cttgttaggg ctttaatgct 78420 
aaactcaatg taagctgggt ttatgtaatt tagtggtttc tagacttagg cttcctaaat 78480 
gttccattag acaagaagac aaaatatgaa acaaaataac atcaaactac cattaacaat 78540 
tagcattgtc agagtcagta aaaactgttt tgaaataaag cagcttttca ggccagtatt 78600 
tgagaactaa tgatagagtt tttttgtagg caatgaatga agtgttgtgg aacaaatgtg 78 660 
gatttggact ttcaagacca acttgaacca gagtcttgaa tctgcaactt tactagctgt 78720 
ggcttctttt taattgatta actttctttt tctgtaaaat gggaataatg attcctacct 78780 
cttacaagcg ttgtggtagg aggtattcca aatgagataa atatgtgaaa tcactaaaac 78840 
ttccagaata catggaaata atatatgtct gtattggcaa attatgagtg tagtaatgag 78900 
-ggatttacat ggacacattt acattcttta gttcagactc cctttgacga cacataacaa 78 960 
acacacacaa gaatatttgc cctgtgtcca gcactctttt gagaatttct ctcttctcct 79020 
ccttctgata tcccaaccaa ccaaccagta ggaatgtgcc ctagaaacta tgttttacag 79080 
ctcagtccac atcttttcac cttacaaatt acttattgtt agtttcttgg gaaagtaaaa 79140 
ttgagatatt aagattctat gaaattggtt ggtggtattg aagactgtgc tgctgctgct 79200 
gctactgctc cagaactcta gtggagatat tacaaatggg agcagaaaaa gtgagaaggt 79260 
ggtctgtaga gacagagcac agcacagagg tagaaaggtt gagataaaga aacaatgcag 79320 
tgagagaagg agagaacaca gacatagaaa cagagacata aaggcaccag acacgagagc 79380 
agagataatt taataaggcc gtagtttctc ttggatttcc agttccttct tttagtcaat 79440 
tctgatattt agcacactgc actgcaccag atatcttgtc gccttcaaat aaagtatttt 79500 
tcccccttaa acatgctaat ttgggcttgt attatttgca atcaatagaa ttgagaaaaa 79560 
cactctatgt catatctgaa ctagtatatc agaatggttt tctgccattg gtaacagaga 7 9620 
cccagctcaa tggcttaaac acacaagggt ttatcctgtc acattaacaa aaaaaaaagt 7 9680 
gcagagtagc agtccggggt tggaataggg tccacaaagt catctggggc caggcttcat 79740 
ctatctttct cttacatcac aagtgacttc tgtctacaag gcccctcatg gtccacaatt 79800 
gattgttttt acaacagctt tccaaagagc agaaagaaaa acaagtagaa gagcttgtac 79860 
tttcttttct atttatgtca cctctgctta catctcacta gccagatata .gacatatggc 79920 
cccatctacc tgcacagaaa tctcagaaat gtaatttttc agcttggcca ttgcccaggg 79980 
ttctgttact agggaagaag * 80000 



<210> 6 

<211> 2260 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 
<222> (0) . . . (0) 

<223> Corresponds to a cDNA detected by cDNA cloning. It 
includes two exons. It is transcribed from the 
+strand of Seq_ID_06.doc. 
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<400> 6 

atttgaattg gtgaacttag taaagcagac ggctctcacc aataagggca ggcatcatcc 60 
aatctgtcga aagcttgaat aaaacaaaaa gaggaaggga aaatttgctt cttttcttct 120 
tgatctagta tatcatcttc tcctgccctt ggatgtgagt gggccttcag acttaaacca 180 
ggagttacac ctttggcttc cctggttctc agttctttgg acttggactg aattacactg 240 
ccaggtttcc tggttctcca gcttgcagat ggcagatcat gggacttctt ggcctccata 300 
attgttttca tatctccagg cctttcattg ggtcaggttg gcatttcgct gccctttatg 360 
tgtgtgacaa gtgaaaataa ggaaagaaaa aaactcaagt gaagaaaatc agaatctgcg 420 
cagcagttcc tgggcgtttc agctgcttcc cacatcacct gcctcatcaa gccccagcat 480 
ccatctcctt gctcatctta caccctgtgt gcatgacagg cccaccattc atttatcaga 540 
gcaaaggctc tcccactatt ctggttcacc cccctactta gccagatata caagaatatc 600 
tgcacggatg acctgcctca cctgggagct cagaggagct cagattccat tactatcgca 660 
ccaaggacag atctcccagc aagaatgaca gaaaagacta actgccccca aaatctccct 720 
tccaaaacac agttctctta attctcccaa gaaaccagaa tgtgactgct cacctctcta 780 
aggacctgaa aacaactggc catttcagct atttaaatca actttaaaaa atccaaccgc 840 
caaaatatta aaccattttg gttggaatga taacataact aacctgctga cagctgcttc 900 
tgctaggtgc aaaaatggaa aaaaaaatac ttctaatcag gtcaaatcac tctacctttg 960 
ggattctaaa tttactcata ttctcaaaga aatatattca gtcatagtgg ggaaaatagg 1020 
attattcctt tagctcgata agcaaccaga agttcttcct tcaaatcttg acatttaatc 1080 
aatcagaaat tgatttttgg aaaactgttt cctatgaagc tatctctgcc tgaaggattt 1140 
ttcttttaca atccagacta tagaaggaaa ttcacaacct ggactttcac ctccattggt 1200 
cagagtttta ctgaccaatt cccacctctg ccttacacct aacggaagtt tatgcctgtt 1260 
ttctcttcac ataccccaac agttacaaat ggttgttatt attaagcatc ttttattttg 1320 
tggcctctga ttacatggtc ccctaaattt tgacctaatc acaaaagatt ggtaaaattt 1380 
cttaacatat taataatatt ttgtttatgt gtcaatatct tagcatgtat caattaagac 1440 
agaggtctta acgttctctt tttgaaagag aatattagga ttcagagata ttaagagatt 1500 
ctcccaggat cacagttagg taacagagct ggattttagt ccaggtctgt ctacagctct 1560 
aacgtatata caccctttgt ataacatgtc acgaattcag cataaaggga tcttcagtga 1620 
tctaagtcag gggtcagcaa ccttttctaa aaaggaccaa atagtaatat ttcaggcttt 1680 
gtggacccta tggtctctat cataactgtt caaatcacca tgtagtgtaa aaggagccat 1740 
aagcaaaata taaactaacg aatgtggctg ttttatggga ttttttttta actctttatt 1800 
tacaaaagca ggtggcagat cagaactcac ttatgggcca tagttctctg acccctgacc 1860 
tgagaaaatc ttatatttat ggacaacatt tagactgtga cttgccaagt aagaacaaga 1920 
agctctgtca actgaaggtc aaggctggag ttctgaaagc aaagagctgt ctggtgttaa 1980 
tgataagtga aatagttaaa gttagaagat cccagttata agaagcacaa agaataatga 2040 
ccatagactc ctgaacaaga atgtctggac ttctggctta ggcactcttg ttgtatggtc 2100 
caggccaagt tacctaatct ctccaggcct ccattttctt atcattaaat gaagataata 2160 
aaagtatttt cctcagagag ctgtaagaat aaactgagct aacccatgtc aagcacatag 2220 
aatagggccc agcctatatt aatttatcaa taaatgccag 2260 

<210> 7 

<211> 2817 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 
<222> (0) . . . (0) 

<223> Corresponds to a cDNA detected by cDNA cloning. It 
includes three exons, the first and third are 
identical to the two exons in Seq_ID_06.doc. It is 
transcribed from the +strand of Seq_ID_06.doc 

<400> 7 

atttgaattg gtgaacttag taaagcagac ggctctcacc aataagggca ggcatcatcc 60 

aatctgtcga aagcttgaat aaaacaaaaa gaggaaggga aaatttgctt cttttcttct 120 

tgatctagta tatcatcttc tcctgccctt ggatgtgagt gggccttcag acttaaacca 180 

ggagttacac ctttggcttc cctggttctc agttctttgg acttggactg aattacactg 240 

ccaggtttcc tggttctcca gcttgcagat ggcagatcat gggacttctt ggcctccata 300 

attgtgggca aaaaggaaag agacaaaaca gcatgaaatg atgagaccaa gtgatgaaaa 360 
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ttcattcaca atgattgctt tcaagagtaa tttctcttgg gtaattcagc agcctgttac 420 
tatggctctc tggagtgata gctaatgtaa atgaagcctc taaaagtgga ttatcctgac 4 80 
aagaatatac tcagccaata atgcaacaga aatccattca aagcattcgg gaaaaattca 54 0 
aaagaataaa tattcttttt ttttttttaa agttaatgac ctacgatcca tttcttccct 600 
gactaacaag cagcaagcac ttaaaaatat ccagccagga tgaaatagaa acccacctga 660 
cttgttaata tttttgtttg gtcccaggga ctcagattct aagccaaatt ctttgaatga 720 
tcttggcaaa tgtctcgaat tatttttgcc aacttttctt tatcttggaa aaaaagtttc 780 
atgaatgggt gtcaaaattg attagtttta aaaacctttc ttgcagatac gtatggcacc 840 
ctaaaactgt attagaaaaa aatttcatat ctccaggcct ttcattgggt caggttggca 900 
tttcgctgcc ctttatgtgt gtgacaagtg aaaataagga aagaaaaaaa ctcaagtgaa 960 
gaaaatcaga atctgcgcag cagttcctgg gcgtttcagc tgcttcccac atcacctgcc 1020 
tcatcaagcc ccagcatcca tctccttgct catcttacac cctgtgtgca tgacaggccc 1080 
accattcatt tatcagagca aaggctctcc cactattctg gttcaccccc ctacttagcc 1140 
agatatacaa gaatatctgc acggatgacc tgcctcacct gggagctcag aggagctcag 1200 
attccattac tatcgcacca aggacagatc tcccagcaag aatgacagaa aagactaact 1260 
gcccccaaaa tctcccttcc aaaacacagt tctcttaatt ctcccaagaa accagaatgt 1320 
gactgctcac ctctctaagg acctgaaaac aactggccat ttcagctatt taaatcaact 1380 
ttaaaaaatc caaccgccaa aatattaaac cattttggtt ggaatgataa cataactaac 1440 
ctgctgacag ctgcttctgc taggtgcaaa aatggaaaaa aaaatacttc taatcaggtc 1500 
aaatcactct acctttggga ttctaaattt actcatattc tcaaagaaat atattcagtc 1560 
atagtgggga aaataggatt attcctttag ctcgataagc aaccagaagt tcttccttca 1620 
aatcttgaca tttaatcaat cagaaattga tttttggaaa actgtttcct atgaagctat 1680 
ctctgcctga aggatttttc ttttacaatc cagactatag aaggaaattc acaacctgga 1740 
ctttcacctc cattggtcag agttttactg accaattccc acctctgcct tacacctaac 1800 
ggaagtttat gcctgttttc tcttcacata ccccaacagt tacaaatggt tgttattatt 1860 
aagcatcttt tattttgtgg cctctgatta catggtcccc taaattttga cctaatcaca 1920 
aaagattggt aaaatttctt aacatattaa taatattttg tttatgtgtc aatatcttag 1980 
catgtatcaa ttaagacaga ggtcttaacg ttctcttttt gaaagagaat attaggattc 2040 
agagatatta agagattctc ccaggatcac agttaggtaa cagagctgga ttttagtcca 2100 
ggtctgtcta cagctctaac gtatatacac cctttgtata acatgtcacg aattcagcat 2160 
aaagggatct tcagtgatct aagtcagggg tcagcaacct tttctaaaaa ggaccaaata 2220 
gtaatatttc aggctttgtg gaccctatgg tctctatcat aactgttcaa atcaccatgt 2280 
agtgtaaaag gagccataag caaaatataa actaacgaat gtggctgttt tatgggattt 2340 
ttttttaact ctttatttac aaaagcaggt ggcagatcag aactcactta tgggccatag 2400 
ttctctgacc cctgacctga gaaaatctta tatttatgga caacatttag actgtgactt 2460 
gccaagtaag aacaagaagc tctgtcaact gaaggtcaag gctggagttc tgaaagcaaa 2520 
gagctgtctg gtgttaatga taagtgaaat agttaaagtt agaagatccc agttataaga 2580 
agcacaaaga ataatgacca tagactcctg aacaagaatg tctggacttc tggcttaggc 2640 
actcttgttg tatggtccag gccaagttac ctaatctctc caggcctcca ttttcttatc 2700 
attaaatgaa gataataaaa gtattttcct cagagagctg taagaataaa ctgagctaac 2760 
ccatgtcaag cacatagaat agggcccagc ctatattaat ttatcaataa atgccag 2817 

<210> 8 

<211> 1970 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 
<222> (0) . . . (0) 

<223> Corresponds to a cDNA detected by cDNA cloning. It 
includes two exons, the last is also present as 
"exon-2" in a human cDNA clone (sequence acc. no. 
BC036936) . It is transcribed from the +strand of 
Seq_ID_06.doc 

<400> 8 

atttgaattg gtgaacttag taaagcagac ggctctcacc aataagggca ggcatcatcc 60 
aatctgtcga aagcttgaat aaaacaaaaa gaggaaggga aaatttgctt cttttcttct 120 
tgatctagta tatcatcttc tcctgccctt ggatgtgagt gggccttcag acttaaacca 180 
ggagttacac ctttggcttc cctggttctc agttctttgg acttggactg aattacactg 240 
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ccaggtttcc tggttctcca gcttgcagat ggcagatcat gggacttctt ggcctccata 300 

attgtgtatc cccactgtgt agagctggtc cacaatggag aggaggaaaa gttttgcatt 360 

ggaccaaatc actgccttag aaggaaagag attgcaatcc cattcatttt taagatctgc 420 

aatgggttgg gatgataccc atttaaaaaa gaaaaaaaaa gaggttacct ctttggcctc 480 

tttggcaaag gatatgaact agtaataaag cctcagcatc aaggcttggg agtctggctc 540 

aaagtggcta agactagaag catcttctaa cataaatatg gcagcattag gaaaatagct 600 

tggcctttag agagacccag attatcttca ccttaaaaac taccaagacc ctggtccagc 660 

aagaccacca tcacagcagt aaatacatag ccaccaactt cacagaaggt tttccactga 720 

gacattgtga aaaccacaat ccattaccgg tggatgagtc ccagctctct tactacctgg 780 

agcttccaag caagacttac ctgtcctttg ttccagattg gttcctggga tcctggacca 840 

acacaaagag aagaggaaag cctcaaggga aataaaagtg aaggtttcag tgatagttta 900 

aatatatatt tttaaatgtt agcttttttt taaatacttc acagtgatgg atggagtatt 960 

tcaaccacaa tggcaaagaa tgaattaccg gcagtgtcag agtttcttgt tgctgtggga 1020 

agacagctgc catgttgaga ggtgccttgt ggagaggctc atgtggcgag gaactgagac 1080 

tggcttctat ccaacaacca gtgaggaact aaggccctaa agtcctatag tccacaagga 1140 

actaaatcct accaacaact acatgagtga gcctggaatc tgattttccc cagtggggac 1200 

ttaaggtacc tacaacctgg ctgttatggg ctaaactgca tcctgcccaa atttatatgt 1260 

ttgagtctta actcccagca cttcagaaca tggttttgtt tggagacata gtcattaaag 1320 

aggtaatgaa gttaaaatga ggtcattagg atgagtccta atccaatagg actggtgttc 1380 

ttatatgaag aggaaatttg aacacagaca catatacagg aaagaccatg cgaagaagac 14 40 

acagagagga aacggccatc tacaatccaa ggagagaggc ctcagaacaa gccaaccctg 1500 

cagatacctt gatctaggca tccagaattg tatgaaaata catttttgtt atttaagcta 1560 

cccagtctgt ggtgctttgt tatggcaatc ccagaaaact aattaactaa ttgactgcag 1620 

ccttgcaagg gaccctgaac cagaggaccc agttaagcca tgtgcagatt cctgactcac 1680 

agaactagga gataacaaat gtttgttgtt ctaaggtgct aaatgttggg aaaatatgtt 1740 

atgtggctat gttgttttgc acatgtgcca cataatgatg ttttggtcaa caacagacca 1800 

cacatatgat ggtgggccca taagattgta atactgtgtt tctattatgc attctctatt 1860 

tatagataca caaatacttg ccactgtgtt acaattgcct actgtcttca gtactgtaac 1920 

atgctgtaca ggtttgtagc ccagaagcaa tagactatac catacagtca 1970 

<210> 9 
<211> 1989 
<212> RNA 

<213> Homo sapiens 
<220> 

<221> mRNA 
<222> (0) . . . (0) 

<223> Corresponds to a human cDNA sequence (sequence 

acc. no. BC036936) . It is transcribed from the 
+strand of SecLlD_06.doc 

<400> 9 

tttgcaggcc cggcacagtc tcagttcccc acagcacaga gaggaagcaa ggtctttgca 60 

tgccctgttc acttggccaa acattgccct agatgtgcaa ttcaacccat ataaagcaag 120 

cgtaattcaa gagccctgac tctttcacca tttcatgatg gcttattact cgcacacggc 180 

attgtcaggt tccagcaact ctgtgtgcag caggcagaaa tgcagtgttt atctctgaca 240 

ggcaaacatt cttaggctca tcggcatgag tatccacgct gaactggcta tcggggaagt 300 

agatccctgc atgggaaagt caaggtatcc ccactgtgta gagctggtcc acaatggaga 360 

ggaggaaaag ttttgcattg gaccaaatca ctgccttaga aggaaagaga ttgcaatccc 420 

attcattttt aagatctgca atgggttggg atgataccca tttaaaaaag aaaaaaaaag 480 

aggttacctc tttggcctct ttggcaaagg atatgaacta gtaataaagc ctcagcatca 540 

aggcttggga gtctggctca aagtggctaa gactagaagc atcttctaac ataaatatgg 600 

cagcattagg aaaatagctt ggcctttaga gagacccaga ttatcttcac cttaaaaact 660 

accaagaccc tggtccagca agaccaccat cacagcagta aatacatagc caccaacttc 720 

acagaaggtt ttccactgag acattgtgaa aaccacaatc cattaccggt ggatgagtcc 780 

cagctctctt actacctgga gcttccaagc aagacttacc tgtcctttgt tccagattgg 84 0 

ttcctgggat cctggaccaa cacaaagaga agaggaaagc ctcaagggaa ataaaagtga 900 

aggtttcagt gatagtttaa atatatattt ttaaatgtta gctttttttt aaatacttca 960 

cagtgatgga tggagtattt caaccacaat ggcaaagaat gaattaccgg cagtgtcaga 1020 

gtttcttgtt gctgtgggaa gacagctgcc atgttgagag gtgccttgtg gagaggctca 1080 
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tgtggcgagg 
gtcctatagt 
gattttcccc 
cctgcccaaa 
ggagacatag 
tccaatagga 
aagaccatgc 
tcagaacaag 
atttttgtta 
attaactaat 
gtgcagattc 
aatgttggga 
tttggtcaac 
ctattatgca 
ctgtcttcag 
atacagtca 



aactgagact 
ccacaaggaa 
agtggggact 
tttatatgtt 
tcattaaaga 
ctggtgttct 
gaagaagaca 
ccaaccctgc 
tttaagctac 
tgactgcagc 
ctgactcaca 
aaatatgtta 
aacagaccac 
ttctctattt 
tactgtaaca 



ggcttctatc 
ctaaatccta 
taaggtacct 
tgagtcttaa 
ggtaatgaag 
tatatgaaga 
cagagaggaa 
agataccttg 
ccagtctgtg 
cttgcaaggg 
gaactaggag 
tgtggctatg 
acatatgatg 
atagatacac 
tgctgtacag 



caacaaccag 
ccaacaacta 
acaacctggc 
ctcccagcac 
ttaaaatgag 
ggaaatttga 
acggccatct 
atctaggcat 
gtgctttgtt 
accctgaacc 
ataacaaatg 
ttgttttgca 
gtgggcccat 
aaatacttgc 
gtttgtagcc 



tgaggaacta 
catgagtgag 
tgttatgggc 
ttcagaacat 
gtcattagga 
acacagacac 
acaatccaag 
ccagaattgt 
atggcaatcc 
agaggaccca 
tttgttgttc 
catgtgccac 
aagattgtaa 
cactgtgtta 
cagaagcaat 



aggccctaaa 
cctggaatct 
taaactgcat 
ggttttgttt 
tgagtcctaa 
atatacagga 
gagagaggcc 
atgaaaatac 
cagaaaacta 
gttaagccat 
taaggtgcta 
ataatgatgt 
tactgtgttt 
caattgccta 
agactatacc 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
1989 



<210> 10 

<211> 3931 

<212> RNA 

<213> Homo sapiens 



<220> 

<221> mRNA 
<222> (0) . . . (0) 

<223> Corresponds to a cDNA detected partly by cDNA 

cloning, partly by PGR analysis. It includes two 
exons, the last is identical to the last exon in 
Seq_ID_06 and SeqL.ID_07.doc. It is transcribed 
from the +strand of Seq_ID_06.doc 



<400> 10 

gaaggtaaaa 

gggtgagagt 

ttttttacca 

gatgctagaa 

gacaaaaaga 

gatataacat 

ttcagtttgg 

aatctcaggt 

ggattcatat 

acttagtatt 

gtgtttctcc 

cctgagtttc 

ggatgttcaa 

ctagcagcct 

ctaactctga 

ttagcaaatt 

tgtgtgagca 

aacatattat 

gttggtaatg 

cttaaacata 

tagatggaag 

gtgtgtgtat 

ggttgcctcc 

atcataacaa 

aacagcatga 

gtaatttctc 

gtaaatgaag 

cagaaatcca 

ttaaagttaa 



aaaagaggga 
aaattctctt 
actaaggaaa 
aaactatgtt 
tcagagaaga 
atttgggttg 
tccctggaac 
cccatcccag 
gcacagtaga 
tctgagtttt 
tttacttact 
ctcatctata 
caaatgctcc 
ttgtgctatt 
aataaataag 
acttcttaag 
atatgtcact 
ttattctacg 
taaaacaaac 
ttcctcgtac 
taaattagtc 
gtgtgtgtgt 
ataagctggc 
aagccctctt 
aatgatgaga 
ttgggtaatt 
cctctaaaag 
ttcaaagcat 
tgacctacga 



ggtctgagaa 
ttagcattca 
aataacactt 
gcagtggttt 
gaagaaagga 
aaggtaaaat 
agcagcatca 
actcactgaa 
gctggcgaag 
cccaaggaag 
agctgcatga 
aaatggagac 
tttcattcct 
atctgttttc 
ctgattattt 
atattatttt 
ttcataaagc 
tttttgtcca 
tcagtacagt 
caatgaagtt 
tttatgttcc 
gacagagtgt 
tgctatgatt 
tatgactatc 
ccaagtgatg 
cagcagcctg 
tggattatcc 
tcgggaaaaa 
tccatttctt 



atagaaatat 
gattccacag 
gacctaacat 
gctctaattt 
aactatcaga 
tttatattgt 
gaatctgccg 
tcagaatata 
ttccattgta 
aaaacccagg 
ctcatgagca 
tataataatc 
ctatttacag 
taaacttagt 
atttattttc 
acatttctat 
caggtataca 
aaaattttaa 
agtattcagt 
acatgaaaag 
ccacaaattg 
gtgtgagaga 
aataagacca 
ttttatcagg 
aaaattcatt 
ttactatggc 
tgacaagaat 
ttcaaaagaa 
ccctgactaa 



cagaggaagg 
attccacaaa 
ttcattgcag 
cttcaggaat 
aaaatacaga 
aatcttaagt 
agggcttgtt 
aatactgaca 
gcctgtgatt 
ccttagcttc 
aggaaatcaa 
atctcctagg 
acctgccgca 
aattgagtgt 
tcaaaacaac 
attctcctac 
ttatggacag 
atttcaactg 
acagtattta 
caaatttgtg 
aaatgcattt 
gagacagaga 
agttttctaa 
ggcaaaaagg 
cacaatgatt 
tctctggagt 
atactcagcc 
taaatattct 
caagcagcaa 



aaataaagga 
tcacatttct 
ttagctaaag 
agagaaaagt 
attggagtag 
atcttgctac 
aaaaaggcag 
agatgccccg 
gttttctgca 
tggcagactt 
actttatgtg 
cttgttttga 
gacaattctg 
gatctggaga 
agaatacgat 
cctgagttga 
gtaagtaaaa 
ttgcgcgtgt 
agcccctgta 
tgagatatcg 
caaaaactct 
gatacgcttt 
agaaaatgag 
aaagagacaa 
gctttcaaga 
gatagctaat 
aataatgcaa 
tttttttttt 
gcacttaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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atatccagcc 
gggactcaga 
tgccaacttt 
tttaaaaacc 
atatctccag 
agtgaaaata 
ctgggcgttt 
tgctcatctt 
ctcccactat 
gacctgcctc 
gatctcccag 
cagttctctt 
aaacaactgg 
aaaccatttt 
caaaaatgga 
atttactcat 
ttagctcgat 
ttgatttttg 
aatccagact 
actgaccaat 
cataccccaa 
attacatggt 
ttaataatat 
aacgttctct 
tcacagttag 
acaccctttg 
ggggtcagca 
atggtctcta 
ataaactaac 
aggtggcaga 
cttatattta 
aactgaaggt 
aaatagttaa 
cctgaacaag 
ttacctaatc 
tcctcagaga 
cagcctatat 



aggatgaaat 
ttctaagcca 
tctttatctt 
tttcttgcag 
gcctttcatt 
aggaaagaaa 
cagctgcttc 
acaccctgtg 
tctggttcac 
acctgggagc 
caagaatgac 
aattctccca 
ccatttcagc 
gcfttggaatg 
aaaaaaaata 
attctcaaag 
aagcaaccag 
gaaaactgtt 
atagaaggaa 
tcccacctct 
cagttacaaa 
cccctaaatt 
tttgtttatg 
ttttgaaaga 
gtaacagagc 
tataacatgt 
accttttcta 
tcataactgt 
gaatgtggct 
tcagaactca 
tggacaacat 
caaggctgga 
agttagaaga 
aatgtctgga 
tctccaggcc 
gctgtaagaa 
taatttatca 



agaaacccac 
aattctttga 
ggaaaaaaag 
atacgtatgg 
gggtcaggtt 
aaaactcaag 
ccacatcacc 
tgcatgacag 
ccccctactt 
tcagaggagc 
agaaaagact 
agaaaccaga 
tatttaaatc 
ataacataac 
cttctaatca 
aaatatattc 
aagttcttcc 
tcctatgaag 
attcacaacc 
gccttacacc 
tggttgttat 
ttgacctaat 
tgtcaatatc 
gaatattagg 
tggattttag 
cacgaattca 
aaaaggacca 
tcaaatcacc 
gttttatggg 
cttatgggcc 
ttagactgtg 
gttctgaaag 
tcccagttat 
cttctggctt 
tccattttct 
taaactgagc 
ataaatgcca 



ctgacttgtt 
atgatcttgg 
tttcatgaat 
caccctaaaa 
ggcatttcgc 
tgaagaaaat 
tgcctcatca 
gcccaccatt 
agccagatat 
tcagattcca 
aactgccccc 
atgtgactgc 
aactttaaaa 
taacctgctg 
ggtcaaatca 
agtcatagtg 
ttcaaatctt 
ctatctctgc 
tggactttca 
taacggaagt 
tattaagcat 
cacaaaagat 
ttagcatgta 
attcagagat 
tccaggtctg 
gcataaaggg 
aatagtaata 
atgtagtgta 
attttttttt 
atagttctct 
acttgccaag 
caaagagctg 
aagaagcaca 
aggcactctt 
tatcattaaa 
taacccatgt 

g 



aatatttttg 
caaatgtctc 
gggtgtcaaa 
ctgtattaga 
tgccctttat 
cagaatctgc 
agccccagca 
catttatcag 
acaagaatat 
ttactatcgc 
aaaatctccc 
tcacctctct 
aatccaaccg 
acagctgctt 
ctctaccttt 
gggaaaatag 
gacatttaat 
ctgaaggatt 
cctccattgg 
ttatgcctgt 
cttttatttt 
tggtaaaatt 
tcaattaaga 
attaagagat 
tctacagctc 
atcttcagtg 
tttcaggctt 
aaaggagcca 
aactctttat 
gacccctgac 
taagaacaag 
tctggtgtta 
aagaataatg 
gttgtatggt 
tgaagataat 
caagcacata 



tttggtccca 
gaattatttt 
attgattagt 
aaaaaatttc 
gtgtgtgaca 
gcagcagttc 
tccatctcct 
agcaaaggct 
ctgcacggat 
accaaggaca 
ttccaaaaca 
aaggacctga 
ccaaaatatt 
ctgctaggtg 
gggattctaa 
gattattcct 
caatcagaaa 
tttcttttac 
tcagagtttt 
tttctcttca 
gtggcctctg 
tcttaacata 
cagaggtctt 
tctcccagga 
taacgtatat 
atctaagtca 
tgtggaccct 
taagcaaaat 
ttacaaaagc 
ctgagaaaat 
aagctctgtc 
atgataagtg 
accatagact 
ccaggccaag 
aaaagtattt 
gaatagggcc 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
24 60 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3931 



<210> 11 
<211> 9458 
.<212> RNA 
<213> Homo sapiens 

<220> 

<221> itiRNA 
<222> (0) . . . (0) 

<223> Corresponds to a cDNA detected by cDNA cloning. It 
includes one exon. The sequence includes the last 
two exons in Seq_ID_07.doc. It is transcribed from 
the +strand of Seq_ID_06.doc 



<400> 11 

atttgaattg 

aatctgtcga 

tgatctagta 

ggagttacac 

ccaggtttcc 

attgtgtgag 

gacaggttct 

taactacaca 

taacctcatg 



gtgaacttag 
aagcttgaat 
tatcatcttc 
ctttggcttc 
tggttctcca 
tcaatttcca 
gagaaatgca 
aacctggaca 
ataagtatgt 



taaagcagac 
aaaacaaaaa 
tcctgccctt 
cctggttctc 
gcttgcagat 
ttttatttac 
ttgttaagtg 
gcatagacta 
ataacatcat 



ggctctcacc 
gaggaaggga 
ggatgtgagt 
agttctttgg 
ggcagatcat 
atatccagtt 
atttcatcat 
ctacacatct 
gataagtatg 



aataagggca 
aaatttgctt 
gggccttcag 
acttggactg 
gggacttctt 
atgcattgct 
tgtgcaaaca 
aggctacatg 
tatgtatcta 



ggcatcatcc 60 
cttttcttct 120 
acttaaacca 180 
aattacactg 24 0 
ggcctccata 300 
taacaatgga 360 
tcatagagtg 420 
gtgtagcttg 480 
ccatatctaa 540 
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atgtagaaaa 
atcctttgta 
acacacacaa 
catttatatt 
cataggtgtg 
acaagcccag 
acctcccaca 
catttatgga 
atttctacca 
actaggctgc 
ggtataatta 
tgaaaaaaca 
aaatatataa 
attcaacatg 
tcaggtaaat 
tcaaatcaca 
aaccaaaatt 
aaattttgaa 
cacaatttta 
tttcttcttt 
attattactg 
ggagttaaca 
tgcctaagac 
acatgccacc 
tttaattatg 
acttttgtca 
ttaaagagga 
tcaccctagg 
tgcagctttg 
ataaaatgag 
aatgcagagt 
aaaaaaaaaa 
agaaatatca 
ttccacagat 
cctaacattt 
tctaatttct 
ctatcagaaa 
tatattgtaa 
atctgccgag 
agaatataaa 
ccattgtagc 
aacccaggcc 
catgagcaag 
taataatcat 
atttacagac 
aacttagtaa 
ttattttctc 
atttctatat 
ggtatacatt 
aattttaaat 
tattcagtac 
atgaaaagca 
acaaattgaa 
gtgagagaga 
taagaccaag 
ttatcagggg 
aattcattca 
actatggctc 
acaagaatat 
caaaagaata 
ctgactaaca 



ggtacagtaa 
gactgaaatg 
atatactatt 
gactctattt 
tgaactggga 
ggcatcagct 
agacctctca 
taaaataggg 
ggcagtgggg 
actgtctgta 
ttgccatgca 
ttttcttggt 
atactttcaa 
aaaatgatta 
acttagggta 
gattcaaaat 
gcttgcttct 
tagtaaaaca 
agtacatgaa 
ttaaaactgg 
attattacag 
aatgatccac 
agagctggct 
atcctcagtt 
atatagctga 
aatttatttt 
gaaactgagc 
ctattaactc 
actatgcctg 
ccatagggct 
ttaaaatatt 
aaaaaaaaaa 
gaggaaggaa 
tccacaaatc 
cattgcagtt 
tcaggaatag 
aatacagaat 
tcttaagtat 
ggcttgttaa 
tactgacaag 
ctgtgattgt 
ttagcttctg 
gaaatcaaac 
ctcctaggct 
ctgccgcaga 
ttgagtgtga 
aaaacaacag 
tctcctaccc 
atggacaggt 
ttcaactgtt 
agtatttaag 
aatttgtgtg 
atgcatttca 
gacagagaga 
ttttctaaag 
caaaaaggaa 
caatgattgc 
tctggagtga 
actcagccaa 
aatattcttt 
agcagcaagc 



aaatatggta 
tcattgtgta 
ggttcttttt 
caaaatttat 
tagtgtcctg 
tatagatatc 
gaagttgaga 
attagcagga 
ccccaagtgc 
gagaatctga 
tcataattct 
ttgaattcta 
gtttgcatat 
tgggaaacat 
tttcatggcc 
caagaatgat 
gtcattctat 
gagtgtcaac 
gtatttttac 
tatttgttat 
gattattact 
tctgggtgtt 
cagcttgctg 
gtccagatag 
caccattcaa 
tcataaataa 
tcctaaagtt 
aggcagtcta 
aattataacg 
caatttcata 
tttgtaaaag 
aaaaggaaga 
ataaaggagg 
acatttcttt 
agctaaagga 
agaaaagtga 
tggagtagga 
cttgctactt 
aaaggcagaa 
atgccccggg 
tttctgcaac 
gcagacttgt 
tttatgtgcc 
tgttttgagg 
caattctgct 
tctggagact 
aatacgattt 
tgagttgatg 
aagtaaaaaa 
gcgcgtgtgt 
cccctgtact 
agatatcgta 
aaaactctgt 
tacgctttgg 
aaaatgagat 
agagacaaaa 
tttcaagagt 
tagctaatgt 
taatgcaaca 
tttttttttt 
acttaaaaat 



taatcttatg 
gtgcatgact 
ctctgaagag 
ggttttggtg 
ctgatgaatg 
aagagctcaa 
aactgctaaa 
ttctttaaat 
agggccatag 
aaaaataata 
aaatcatact 
aataattgtt 
ttttattacc 
tcgggtatac 
ctagtctttg 
aacacagtga 
ggaaccactg 
ttcatgctgg 
aagactaagt 
aaaactaaag 
gaaaaagaaa 
gaaaacacca 
ggtcacagca 
ataatcccat 
agcactatgc 
cccaaatatg 
taaatatcta 
actcaggtat 
tcatgctatc 
aaaggagaga 
tgccagagat 
aggtaaaaaa 
gtgagagtaa 
ttttaccaac 
tgctagaaaa 
caaaaagatc 
tataacatat 
cagtttggtc 
tctcaggtcc 
attcatatgc 
ttagtatttc 
gtttctcctt 
tgagtttcct 
atgttcaaca 
agcagccttt 
aactctgaaa 
agcaaattac 
tgtgagcaat 
catattattt 
tggtaatgta 
taaacatatt 
gatggaagta 
gtgtgtatgt 
ttgcctccat 
cataacaaaa 
cagcatgaaa 
aatttctctt 
aaatgaagcc 
gaaatccatt 
aaagttaatg 
atccagccag 



ggatcaccat 
gtatacgcac 
ccctaataca 
aaacatatgt 
ggtgggaggc 
caagagcact 
agtttcttta 
actttcgaac 
gaagtacaag 
gagtcactga 
agtggtcaaa 
gtggtcacca 
tgttccttaa 
agtccctgac 
gggtaccaca 
ttgtgtagac 
agagttttta 
aatatttttg 
aacatcactg 
agcgaatcaa 
tgtacggaat 
ataagcctgc 
tgtaaggaaa 
agccccatgg 
taagtccttt 
tataccacta 
acccaagtta 
aataacatta 
taactaaaaa 
aaatactggg 
tgagtataac 
aagagggagg 
attctctttt 
taaggaaaaa 
actatgttgc 
agagaagaga 
ttgggttgaa 
cctggaacag 
catcccagac 
acagtagagc 
tgagttttcc 
tacttactag 
catctataaa 
aatgctcctt 
gtgctattat 
taaataagct 
ttcttaagat 
atgtcacttt 
attctacgtt 
aaacaaactc 
cctcgtacca 
aattagtctt 
gtgtgtgtga 
aagctggctg 
gccctcttta 
tgatgagacc 
gggtaattca 
tctaaaagtg 
caaagcattc 
acctacgatc 
gatgaaatag 



catatatgca 
acatacacaa 
atatgttata 
ggagatgggg 
atcatttggg 
ttatggcaaa 
tgacagatga 
actaaccttc 
tctgggagat 
aatgcagttt 
tactcttccc 
ctgagctttt 
caaacattga 
tcttaaggac 
tgtttcttct 
aaaataagtg 
cttgtgctta 
gctttttaga 
aaattacagc 
gaaaagcata 
agaggaggaa 
ttccaggaag 
ctgctgggct 
ggaaataatc 
atgtgaatta 
ttatcctacc 
agactgctag 
tgctactgtt 
gctaagggaa 
gaaaagtgat 
aagtgtgacc 
tctgagaaat 
agcattcaga 
taacacttga 
agtggtttgc 
agaaaggaaa 
ggtaaaattt 
cagcatcaga 
tcactgaatc 
tggcgaagtt 
caaggaagaa 
Gtgcatgact 
atggagacta 
tcattcctct 
ctgttttcta 
gattatttat 
attattttac 
cataaagcca 
tttgtccaaa 
agtacagtag 
atgaagttac 
tatgttcccc 
cagagtgtgt 
ctatgattaa 
tgactatctt 
aagtgatgaa 
gcagcctgtt 
gattatcctg 
gggaaaaatt 
catttcttcc 
aaacccacct 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 
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gacttgttaa tatttttgtt tggtcccagg 
gatcttggca aatgtctcga attatttttg 
tcatgaatgg gtgtcaaaat tgattagttt 
ccctaaaact gtattagaaa aaaagtaagt 
accctctttt acaaactcac aaaaacagcc 
tattgctttc tatataccta catcttgtct 
ggtggctcac acctgtaatt ccagcacttt 
tcaggagttc aggaccagcc tgatcaatat 
aaatcacccg ggtgtggtga cgggcgcctg 
gagaatcact tgaactcagg agtcagaggt 
ccagcctggg tgacagagca agactccatc 
tcaacagcta ttcccacccc tctgcatgga 
ttgggtaatg gccctctggg caggactgga 
gtttagaagc atgtctggga aatgtcatgc 
catgaatgga aaaggagagt aattctatgt 
agcaaaagtg gggaagcaag aggaattatg 
gattggctca gtcatcttgg ctgaggctca 
ttaatttcat ccattacagg aagaggagcc 
tgatggtaga aggtattttg ggactattcc 
tgaattcctt tcaagcaaag gattaaattt 
aagtattgtt gctctgctca gagttttatc 
aaatgacata aatgaggttt tttattgttg 
tagctacctg ggcagagctg ttttatttct 
ccatggaagg cagtcattaa gatgttccca 
ctgcatcctt ccctgtccct caattccatt 
ctcatgaagt tctctctcat ttattaaaat 
ccagtaatgc ctgattggcc ccttatccta 
tgagaaatct tgcaaatcat tgagccaaaa 
gactagtatg tggcaggcag tgccctttta 
gggttcacac atcttaaaga ggtgctatct 
gtaaggaatt gtctttgttt ggttatattc 
aagaaacctt gggatcattt tagactaaga 
cgcctgtaat cccagcactt tgagaggccg 
agaccgtcct ggctaacgtg gtgaaaccct 
ggcgtggtgg cgggcgcctg tagtcccagc 
tgaactcagg aggcggagct tgcagtcagc 
cgacagaacg agactccgtc tcagaacaaa 
agagccaaat catgaacata agctaaacaa 
agcagactct taactagaac . actgcactcc 
ttctaattaa cccctgttac ttaccagtgg 
gccccaattt gttcatctgt agaaggggta 
actgtgagca ttaaatgagt tactactgta 
agagtttgtt aaacacttaa gaactgattt 
ctggaaatga tcaagcatag gccctggaat 
tcgtttcttt tgttcagccc tgtgcctaga 
ttgttgaatg aatcaatgaa cctaccgagg 
aaatctgggc aaagtgacct ccaaggaaat 
ggaactgata aattggtgtg atacaatgta 
cttattttct ctccctcttt tttccttcct 
ctcccttcct tcctccctcc ctcccttcct 
tctttctttc tttctttctt tctttctttc 
cttcctttct ttgtgccttt ctttctttct 
ttctttcttt ctgcctttct ttctctttgt 
gcagaccatg tctgttagat gaatgccttt 
agcacaatta tcaagggtct ccagtcatct 
cagtttcata tctccaggcc tttcattggg 
tgtgacaagt gaaaataagg aaagaaaaaa 
gcagttcctg ggcgtttcag ctgcttccca 
atctccttgc tcatcttaca ccctgtgtgc 
aaaggctctc ccactattct ggttcacccc 
cacggatgac ctgcctcacc tgggagctca 



gactcagatt ctaagccaaa ttctttgaat 4260 
ccaacttttc tttatcttgg aaaaaaagtt 4320 
taaaaacctt tcttgcagat acgtatggca 4380 
actctgtagt gtgaaaaatt cttaaaggac 4440 
tttggaatac ccacatgaag tagctgttgt 4500 
attataaaaa gactggtttt tggcaggtgt 4560 
gggaggccaa ggcgggcgga tcacctgaga 4 620 
ggtgaaaccc agtctttact gaaaatacaa 4 680 
tagtcccagc tactcgggta gctgaggcag 4740 
tgcagtgagc tgagatcatg ccactgcact 4800 
tcaaaaaaaa aaaaaaaaaa gactggtttt 4860 
aatattcacc cagtcaattg ttttcctagt 4 920 
gtggggcaca caggagaagc tgcaaactat 4 980 
aagaaaagac atatttaaag gtaggctttg 504 0 
agagcagagc ctcttacttg cagtgagaga 5100 
cttttcatca gccaaatttg caggtaggag 5160 
tgaaaccagg tgtaaagaaa gtggactaga . 5220 
gtgaaagata atccagaaat cattgggatt 5280 
atttgaaatg agaaggtacc tgacattctt 5340 
acccatgagt tgactcagaa aaaacataaa 5400 
taactcattc tcacttctta ttccatgatg 54 60 
ttgttgttgt tttctggaca caaggcaagg 5520 
ctatgccgtg gagagaaatt ggttaattgg 5580 
tgcgagtgaa ctttccaggg ttcccagctt 5640 
gttggtgatg acaatgtctc tcccatcagc 5700 
ttgctttcag gaaaaatttt gaaaatgtgt 57 60 
aaggcttaaa ctggaggaag gaagctaaac 5820 
acgtattaat agcaagatct atcatttatt 5880 
tttaggcagg gagagttgat ggggggggcg 5940 
cctcctatat aaatcatgta agtcaagaga 6000 
aggggattag agtatacagt agaagatccc 6060 
aatgccaata ccgccgggcg cggtggctca 6120 
aggtgggcgg atcacaaggt caggagattg 6180 
gtctctacta aaaatacaaa aaattagccg 6240 
tactcgggag gcggaggcag gagaatggtg 6300 
cgagattgcc ccaatgcact ccagcctggg 6360 
acaaaaggaa atgccaatac cagcagaaat 6420 
atgttggcag tgtagcctag tggttaagag 6480 
atgtcctcac tgtagaccct cactgtgggg 6540 
cagtcttaag gcattcctta agttcgttgt 6600 
ggatgacagt agtgtttact ttataggctt 6660 
tttgtaaagt gcttaaaatg ctgctccaaa 6720 
acttgcatct aaactgacag ctctcaataa 6780 
ataagcaggt ctacatgaag gcaaaaatgt 6840 
tcaatatcta gtgatcatgc tcaagaaata 6900 
tagttacata aaagagttct gcatgagtac 6960 
ttccactttt agattctgtg atttccttaa 7020 
aaaaaatgtg cctatatgat ttgagaaaaa 7080 
tccttcctcc ctcccttcct tccttcctcc 7140 
tccttctctt tcttcttttc tttctttctt 7200 
tttctttctt ctcttctctt tcctttcttt 7260 
ttctttcttc tctgtccttt ctttcttcct 7320 
ttttctttct ttccttcttt tttcctttaa 7380 
ttctagttaa aaggttaaac aggaaagtga 7440 
ccacatgttc ttaatcatta tcttctttta 7500 
tcaggttggc atttcgctgc cctttatgtg 7560 
actcaagtga agaaaatcag aatctgcgca 7620 
catcacctgc ctcatcaagc cccagcatcc 7680 
atgacaggcc caccattcat ttatcagagc 7740 
cctacttagc cagatataca agaatatctg 7800 
gaggagctca gattccatta ctatcgcacc 7860 
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aaggacagat ctcccagcaa gaatgacaga 
caaaacacag ttctcttaat tctcccaaga 
gacctgaaaa caactggcca tttcagctat 
aaatattaaa ccattttggt tggaatgata 
ctaggtgcaa aaatggaaaa aaaaatactt 
attctaaatt tactcatatt ctcaaagaaa 
tattccttta gctcgataag caaccagaag 
tcagaaattg atttttggaa aactgtttcc 
cttttacaat ccagactata gaaggaaatt 
gagttttact gaccaattcc cacctctgcc 
ctcttcacat accccaacag ttacaaatgg 
gcctctgatt acatggtccc ctaaattttg 
taacatatta ataatatttt gtttatgtgt 
aggtcttaac gttctctttt tgaaagagaa 
cccaggatca cagttaggta acagagctgg 
cgtatataca ccctttgtat aacatgtcac 
taagtcaggg gtcagcaacc ttttctaaaa 
ggaccctatg gtctctatca taactgttca 
gcaaaatata aactaacgaa tgtggctgtt 
caaaagcagg tggcagatca gaactcactt 
agaaaatctt atatttatgg acaacattta 
ctctgtcaac tgaaggtcaa ggctggagtt 
ataagtgaaa tagttaaagt tagaagatcc 
atagactcct gaacaagaat gtctggactt 
ggccaagtta cctaatctct ccaggcctcc 
agtattttcc tcagagagct gtaagaataa 
tagggcccag cctatattaa tttatcaata 



aaagactaac tgcccccaaa atctcccttc 7920 
aaccagaatg tgactgctca cctctctaag 7980 
ttaaatcaac tttaaaaaat ccaaccgcca 8040 
acataactaa cctgctgaca gctgcttctg 8100 
ctaatcaggt caaatcactc tacctttggg 8160 
tatattcagt catagtgggg aaaataggat 8220 
ttcttccttc aaatcttgac atttaatcaa 8280 
tatgaagcta tctctgcctg aaggattttt 8340 
cacaacctgg actttcacct ccattggtca 8400 
ttacacctaa cggaagttta tgcctgtttt 84 60 
ttgttattat taagcatctt ttattttgtg 8520 
acctaatcac aaaagattgg taaaatttct 8580 
caatatctta gcatgtatca attaagacag 8640 
tattaggatt cagagatatt aagagattct 8700 
attttagtcc aggtctgtct acagctctaa 87 60 
gaattcagca taaagggatc ttcagtgatc 8820 
aggaccaaat agtaatattt caggctttgt 8880 
aatcaccatg tagtgtaaaa ggagccataa 8940 
ttatgggatt tttttttaac tctttattta 9000 
atgggccata gttctctgac ccctgacctg 9060 
gactgtgact tgccaagtaa gaacaagaag 9120 
ctgaaagcaa agagctgtct ggtgttaatg 9180 
cagttataag aagcacaaag aataatgacc 924 0 
ctggcttagg cactcttgtt gtatggtcca 9300 
attttcttat cattaaatga agataataaa 9360 
actgagctaa cccatgtcaa gcacatagaa 9420 
aatgccag 9458 



<210> 12 

<211> 552 

<212> DNA 

<213> Homo, sapiens 

<220> 

<221> exon 
<222> (0) . . . (0) 
<223> Exon 1 

<400> 12 

attatgtaga accattccaa tcctgagtga 
acaaacattt ggaggagggg aaaatgtgag 
actgaaagga acagctgcga tcttcacatg 
ccagcaccat tcagcagtaa gcattcaaat 
gcacagtctc agttccccac agcacagaga 
ttggccaaac attgccctag atgtgcaatt 
gccctgactc tttcaccatt tcatgatggc 
cagcaactct gtgtgcagca ggcagaaatg 
taggctcatc ggcatgagta tccacgctga 
gggaaagtca ag 

<210> 13 

<211> 305 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> exon 
<222> (0) . . . (0) 
<223> Exon la 



caggatttgc aaagacagct gttgggctcc 60 
gttggttgct acattctcct acttcttttt 120 
taagatgaag taaacaaaag ctgacaatgc 180 
ggatttttcc tcaccgcgtt tgcaggcccg 240 
ggaagcaagg tctttgcatg ccctgttcac 300 
caacccatat aaagcaagcg taattcaaga 360 
ttattactcg cacacggcat tgtcaggttc 420 
cagtgtttat ctctgacagg caaacattct 480 
actggctatc ggggaagtag atccctgcat 540 

552 
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<400> 13 

atttgaattg gtgaacttag taaagcagac ggctctcacc aataagggca ggcatcatcc 60 
aatctgtcga aagcttgaat aaaacaaaaa gaggaaggga aaatttgctt cttttcttct 120 
tgatctagta tatcatcttc tcctgccctt ggatgtgagt gggccttcag acttaaacca 180 
ggagttacac ctttggcttc cctggttctc agttctttgg acttggactg aattacactg 240 
ccaggtttcc tggttctcca gcttgcagat ggcagatcat gggacttctt ggcctccata 300 
attgt 305 

<210> 14 
<211> 307 
<212> DMA 

<213> Homo sapiens 



<220> 

<221> exon 
<222> (0)...{0) 
<223> Exon lb 

<400> 14 

atttgaattg gtgaacttag taaagcagac 
aatctgtcga aagcttgaat aaaacaaaaa 
tgatctagta tatcatcttc tcctgccctt 
ggagttacac ctttggcttc cctggttctc 
ccaggtttcc tggttctcca gcttgcagat 
attgt gt 

<210> 15 
<211> 557 
<212> DNA 

<213> Homo sapiens 



ggctctcacc aataagggca ggcatcatcc 60 
gaggaaggga aaatttgctt cttttcttct 120 
ggatgtgagt gggccttcag acttaaacca 180 
agttctttgg acttggactg aattacactg 240 
ggcagatcat gggacttctt ggcctccata 300 

307 



<220> 

<221> exon 
<222> (0) - . . (0) 
<223> Exon 2 

<400> 15 

gggcaaaaag gaaagagaca aaacagcatg aaatgatgag accaagtgat gaaaattcat 60 
tcacaatgat tgctttcaag agtaatttct cttgggtaat tcagcagcct gttactatgg 120 
ctctctggag tgatagctaa tgtaaatgaa gcctctaaaa gtggattatc ctgacaagaa 180 
tatactcagc caataatgca acagaaatcc attcaaagca ttcgggaaaa attcaaaaga 240 
ataaatattc tttttttttt tttaaagtta atgacctacg atccatttct tccctgacta 300 
acaagcagca agcacttaaa aatatccagc caggatgaaa tagaaaccca cctgacttgt 360 
taatattttt gtttggtccc agggactcag attctaagcc aaattctttg aatgatcttg 420 
gcaaatgtct cgaattattt ttgccaactt ttctttatct tggaaaaaaa gtttcatgaa 480 
tgggtgtcaa aattgattag ttttaaaaac ctttcttgca gatacgtatg gcaccctaaa 540 
actgtattag aaaaaaa 557 

<210> 16 
<211> 1955 
<212> DNA 

<213> Homo Sapiens 

<220> 

<221> exon 
<222> (0) . . . (0) 
<223> Exon 3 

<400> 16 

tttcatatct ccaggccttt cattgggtca ggttggcatt tcgctgccct ttatgtgtgt 60 
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gacaagtgaa aataaggaaa gaaaaaaact 
gttcctgggc gtttcagctg cttcccacat 
tccttgctca tcttacaccc tgtgtgcatg 
ggctctccca ctattctggt tcacccccct 
ggatgacctg cctcacctgg gagctcagag 
gacagatctc ccagcaagaa tgacagaaaa 
aacacagttc tcttaattct cccaagaaac 
ctgaaaacaa ctggccattt cagctattta 
tattaaacca ttttggttgg aatgataaca 
ggtgcaaaaa tggaaaaaaa aatacttcta 
ctaaatttac tcatattctc aaagaaatat 
tcctttagct cgataagcaa ccagaagttc 
gaaattgatt tttggaaaac tgtttcctat 
ttacaatcca gactatagaa ggaaattcac 
ttttactgac caattcccac ctctgcctta 
ttcacatacc ccaacagtta caaatggttg 
tctgattaca tggtccccta aattttgacc 
catattaata atattttgtt tatgtgtcaa 
tcttaacgtt ctctttttga aagagaatat 
aggatcacag ttaggtaaca gagctggatt 
atatacaccc tttgtataac atgtcacgaa 
gtcaggggtc agcaaccttt tctaaaaagg 
ccctatggtc tctatcataa ctgttcaaat 
aaatataaac taacgaatgt ggctgtttta 
aagcaggtgg cagatcagaa ctcacttatg 
aaatcttata tttatggaca acatttagac 
tgtcaactga aggtcaaggc tggagttctg 
agtgaaatag ttaaagttag aagatcccag 
gactcctgaa caagaatgtc tggacttctg 
caagttacct aatctctcca ggcctccatt 
attttcctca gagagctgta agaataaact 
ggcccagcct atattaattt atcaataaat 

<210> 17 

<211> 366 

<212> DNA 

<213> Homo sapiens 



caagtgaaga aaatcagaat ctgcgcagca 120 
cacctgcctc atcaagcccc agcatccatc 180 
acaggcccac cattcattta tcagagcaaa 240 
acttagccag atatacaaga atatctgcac 300 
gagctcagat tccattacta tcgcaccaag 360 
gactaactgc ccccaaaatc tcccttccaa 420 
cagaatgtga ctgctcacct ctctaaggac 480 
aatcaacttt aaaaaatcca accgccaaaa 540 
taactaacct gctgacagct gcttctgcta 600 
atcaggtcaa atcactctac ctttgggatt 660 
attcagtcat agtggggaaa ataggattat 720 
ttccttcaaa tcttgacatt taatcaatca 780 
gaagctatct ctgcctgaag gatttttctt 840 
aacctggact ttcacctcca ttggtcagag 900 
cacctaacgg aagtttatgc ctgttttctc 960 
ttattattaa gcatctttta ttttgtggcc 1020 
taatcacaaa agattggtaa aatttcttaa 1080 
tatcttagca tgtatcaatt aagacagagg 1140 
taggattcag agatattaag agattctccc 1200 
ttagtccagg tctgtctaca gctctaacgt 1260 
ttcagcataa agggatcttc agtgatctaa 1320 
accaaatagt aatatttcag gctttgtgga 1380 
caccatgtag tgtaaaagga gccataagca 14 40 
tgggattttt ttttaactct ttatttacaa 1500 
ggccatagtt ctctgacccc tgacctgaga 1560 
tgtgacttgc caagtaagaa caagaagctc 1620 
aaagcaaaga gctgtctggt gttaatgata 1680 
ttataagaag cacaaagaat aatgaccata 1740 
gcttaggcac tcttgttgta tggtccaggc 1800 
ttcttatcat taaatgaaga taataaaagt 18 60 
gagctaaccc atgtcaagca catagaatag 1920 
gccag 1955 



<220> 
<221> CDS 
<222> (0) . . . (0) 



<400> 17 

atgttcaaca 

agcagccttt 

aactctgaaa 

agcaaattac 

tgtgagcaat 

catattattt 

tggtaa 



aatgctcctt 
gtgctattat 
taaataagct 
ttcttaagat 
atgtcacttt 
attctacgtt 



tcattcctct 
ctgttttcta 
gattatttat 
attattttac 
cataaagcca 
tttgtccaaa 



atttacagac 
aacttagtaa 
ttattttctc 
atttctatat 
ggtatacatt 
aattttaaat 



ctgccgcaga 
ttgagtgtga 
aaaacaacag 
tctcctaccc 
atggacaggt 
ttcaactgtt 



caattctgct 60 
tctggagact 120 
aatacgattt 180 
tgagttgatg 240 
aagtaaaaaa 300 
gcgcgtgtgt 360 
366 



<210> 18 

<211> 1253 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> axon 

<222> (0) . . . (0) 

<223> ESTbc03936 Exon 2 
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<400> 18 

tatccccact gtgtagagct ggtccacaat ggagaggagg aaaagttttg cattggacca 60 
aatcactgcc ttagaaggaa agagattgca atcccattca tttttaagat ctgcaatggg 120 
ttgggatgat acccatttaa aaaagaaaaa aaaagaggtt acctctttgg cctctttggc 180 
aaaggatatg aactagtaat aaagcctcag catcaaggct tgggagtctg gctcaaagtg 240 
gctaagapta gaagcatctt ctaacataaa tatggcagca ttaggaaaat agcttggcct 300 
ttagagagac ccagattatc ttcaccttaa aaactaccaa gaccctggtc cagcaagacc 360 
accatcacag cagtaaatac atagccacca acttcacaga aggttttcca ctgagacatt 420 
gtgaaaacca caatccatta ccggtggatg agtcccagct ctcttactac ctggagcttc 480 
caagcaagac ttacctgtcc tttgttccag attggttcct gggatcctgg accaacacaa 540 
agagaagagg aaagcctcaa gggaaataaa agtgaaggtt tcagtgatag tttaaatata 600 
tatttttaaa tgttagcttt tttttaaata cttcacggtg atggatggag tatttcaacc 660 
acaatggcaa agaatgaatt accggcagtg tcagagtttc ttgttgctgt gggaagacag 720 
ctgccatgtt gagaggtgcc ttgtggagag gctcatgtgg cgaggaactg agactggctt 780 
ctatccaaca accagtgagg aactaaggcc ctaaagtcct atagtccaca aggaactaaa 840 
tcctaccaac aactacatga gtgagcctgg aatctgattt tccccagtgg ggacttaagg 900 
tacctacaac ctggctgtta tgggctaaac tgcatcctgc ccaaatttat atgtttgagt 960 
cttaactccc agcacttcag aacatggttt tgtttggaga catagtcatt aaagaggtaa 1020 
tgaagttaaa atgaggtcat taggatgagt cctaatccaa taggactggt gttcttatat 1080 
gaagaggaaa tttgaacaca gacacatata caggaaagac catgcgaaga agacacagag 1140 
aggaaacggc catctacaat ccaaggagag aggcctcaga acaagccaac cctgcagata 1200 
ccttgatcta ggcatccaga attgtatgaa aatacatttt tgttatttaa get 1253 

<210> 19 

<211> 21 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 

<222> (0) . . . (0) 

<223> siRNA targeting Exon 1 

<400> 19 

aauaagggca ggcaucaucc a . • 21 

<210> 20 

<211> 21 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 
<222> (0) ... (0) 

<223> siRNA targeting. Exon 1 B 
<400> 20 

aauuacacug ccagguuucc u 21 

<210> 21 

<211> 21 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 
<222> (0) . . . (0) 

<223> siRNA targeting Exon 2 A 



<400> 21 

aauucauuca caaugauugc u 



21 
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<210> 22 

<211> 21 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> iTiRNA 
<222> (0) . . . (0) 

<223> siRNA targeting Exon 2 B 
<400> 22 

aauuucucuu ggguaauuca g 21 

<210> 23 

<211> 21 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 
<222> (0) . . . (0) 

<223> siRNA targeting Exon 3 (DDend) A 
<400> 23 

aaaaucagaa ucugcgcagc a 21 

<210> 24 

<211> 21 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 
<222> (0)...(0) 

<223> siRNA targeting Exon 3 (DDend) B 
<400> 24 

aaugaiigaug ggaagaagga a 21 

<210> 25 

<211> 21 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 

<222> (0)...(0) 

<223> siRNA targeting CDS A 



<400> 25 

aaacuuagua auugagugug a 

<210> 26 

<211> 21 

<212> RNA 

<213> Homo sapiens 

<220> 

<221> mRNA 
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<222> (0) ... (0) 

<223> siRNA targeting CDS B 



<400> 26 

aauaugucac uuucauaaag c 



21 



<210> 27 

<211> 14 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> primer_bind 
<222> (0) . . . (0) 

<223> Exon overlapping oligonucleotide primer F26 
<400> 27 

agacggctct cacc 14 

<210> 28 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> primer_bind 
<222> (0)...(0) 

<223> Exon overlapping oligonucleotide primer FDP5 



<210> 29 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> primer_bind 
<222> (0)...(0) 

<223> cDNA oligonucleotide primer FDP5 
<400> 29 

cctttatgtg tgtgacaagt g 21 

<210> 30 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> primer_bind 
<222> (0)...{0) 

<223> cDNA oligonucleotide primer FIO 



<400> 28 
cacttgtcac 



acacataaag g 



21 



<400> 30 

atccagccag gatgaaatag aa 



22 



<210> 31 
<211> 20 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> priiner_binci 
<222> (0) . (0) 

<223> upstream primer huunan VH family leader sequence 
VHl 

<400> 31 

ccatggactg gacctggagg 20 

<210> 32 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> primer_bind 
<222> (0) . . . (0) 

<223> upstream primer human VH family leader sequence 
VH2 

<400> 32 

atggacatac tttgttccag c 21 

<210> 33 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> primer_bind 
<222> (0) . . . (0) 

<223> upstream primer human VH family leader sequence 
VH3 

<400> 33 

ccatggagtt tgggctgagc 20 

<210> 34 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> primer_bind 
<222> (0) . . . (0) 

<223> upstream primer human VH family leader sequence 
VH4 

<400> 34 

atgaaacacc tgtggttctt 20 

<210> 35 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> primer_bind 
<222> (0) . . . (0) 



wo 2004/046376 



47/48 



PCT/DK2003/000794 



<223> upstream primer human VH family leader sequence 
VH5 

<400> 35 

atggggtcaa ccgcgatcct 20 

<210> 36 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> primer_bind 
<222> (0) . . . (0) 

<223> upstream primer human VH family leader sequence 
VH6 

<400> 36 

atgtctgtct ccttcctcat 20 

<210> 37 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> primer_bind 

<222> (0) . . . (0) 

<223> down stream primer CI 

<400> 37 

gaggctcagc gggaagacct t 21 

<210> 38 

<211> 21 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> primer_bind 

<222> (0) . . . (0) 

<223> down stream primer C 2 

<400> 38 

ggggaagacc gatgggcccc t 21 

<210> 39 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> primer__bind 

<222> (0) . . . (0) 

<223> northern blot primer 

<400> 39 

tcacctggga gctcagagga 20 



<210> 40 
<211> 20 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> primer_bind 

<222> (0) . . . (0) 

<223> northern blot primer 

<400> 40 

gtgatcctgg gagaatctct 20 

<210> 41 
<211> 25 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> primer_bind 

<222> (0) . . . (0) 

<223> RACE antisence primer 



<400> 41 

tacattacca acacacgcgc aacag 



25 



